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FIELD OF THE INVENTION 

The invention concerns the human g35030 gene, and g35030 polynucleotides, 
polypeptides, as well as biallelic markers localized in the g35030 gene and in the human 
chromosome 13q31-q33 region. The invention also concerns the association established 
10 between schizophrenia and bipolar disorder and the biallelic markers and the g3503O gene and 

nucleotide sequences. The invention provides means to identify compounds useful in the 
treatment of schizophrenia, bipolar disorder and related diseases, means to determine the 
predisposition of individuals to said disease as well as means for the disease diagnosis and 
prognosis. 

15 

BACKGROUND OF THE INVENTION 

Advances in the technological armamentarium available to basic and clinical 
investigators have enabled increasingly sophisticated studies of brain and nervous system 
function in health and disease. Numerous hypotheses both neurobiological and 

20 pharmacological have been advanced with respect to the neurochemical and genetic 

mechanisms involved in central nervous system (CNS) disorders, including psychiatric 
disorders and neurodegenerative diseases. However, CNS disorders have complex and poorly 
understood etiologies, as well as symptoms that are overlapping, poorly characterized, and 
difficult to measure. As a result future treatment regimes and drug development efforts will be 

25 required to be more sophisticated and focused on multigenic causes, and will need new assays 

to segment disease populations, and provide more accurate diagnostic and prognostic 
information on patients suffering from CNS disorders. 

Genetic Analysis of Complex Traits 

30 Until recently, the identification of genes linked with detectable traits has relied mainly 

on a statistical approach called linkage analysis. Linkage analysis is based upon establishing a 
correlation between the transmission of genetic markers and that of a specific trait throughout 
generations within a family. Linkage analysis involves the study of families with multiple 
affected individuals and is useful in the detection of inherited-traits, which are caused by a 

35 single gene, or possibly a very small number of genes. But, linkage studies have proven 
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difficult when applied to complex genetic traits. Most traits of medical relevance do not follow 
simple Mendelian monogenic inheritance. However, complex diseases often aggregate in 
families, which suggests that there is a genetic component to be found. Such complex traits are 
often due to the combined action of multiple genes as well as environmental factors. Such 
5 complex trait, include susceptibilities to heart disease, hypertension, diabetes, cancer and 

inflammatory diseases. Drug efficacy, response and tolerance/toxicity can also be considered as 
multifactoral traits involving a genetic component in the same way as complex diseases. 
Linkage analysis cannot be applied to the study of such traits for which no large informative 
families are available. Moreover, because of their low penetrance, such complex traits do not 

10 segregate in a clear-cut Mendelian manner as they are passed from one generation to the next. 

Attempts to map such diseases have been plagued by inconclusive results, demonstrating the 
need for more sophisticated genetic tools. 

Knowledge of genetic variation in the neuronal and endocrine systems is important for 
understanding why some people are more susceptible to disease or respond differently to 

15 treatments. Ways to identify genetic polymorphism and to analyze how they impact and predict 

disease susceptibility and response to treatment are needed. 

Although the genes involved in the neuronal and endocrine systems represent major 
drug targets and are of high relevance to pharmaceutical research* we still have scant knowledge 
concerning the extent and nature of, sequence variation in these genes and their regulatory 

20 elements. In the case where polymorphisms have been identified the relevance of the variation 

is rarely understood. While polymorphisms hold promise for use as genetic markers in 
determining which genes contribute to muitigenic or quantitative traits, suitable markers and 
suitable methods for exploiting those markers have not been found and brought to bare on the 
genes related to disorders of the brain and nervous system. 

25 The basis for accomplishment of these goals is to use genetic association analysis to 

detect markers that predict susceptibility for these traits. Recently, advances in the fields of 
genetics and molecular biology have allowed identification of forms, or alleles, of human genes 
that lead to diseases. Most of the genetic variations responsible for human diseases identified so 
far, belong to the class of single gene disorders. As this name implies, the development of 

30 single gene disorders is determined, or largely influenced, by the alleles of a single gene. The 

alleles that cause these disorders are, in general, highly deleterious (and highly penetrant) to 
individuals who carry them. Therefore, these alleles arid their associated diseases, with some 
exceptions, tend to be very rare in the human population. In contrast, most common diseases 
and non-disease traits, such as a physiological response to a pharmaceutical agent, can be 

35 viewed as the result of many complex factors. These can include environmental exposures 
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(toxins, allergens, infectious agents, climate, and trauma) as well as multiple genetic factors. 

Association studies seek to analyze the distributions of chromosomes that have 
occurred in populations of unrelated (at least not directly related) individuals. An assumption in 
this type of study is that genetic alleles that result in susceptibility for a common trait arose by 
5 ancient mutational events on chromosomes that have been passed down through many 

generations in the population. These alleles can become common throughout the population in 
part because the trait they influence, if deleterious, is only expressed in a fraction of those 
individuals who carry them. Identification of these "ancestral" chromosomes is made difficult 
by the fact that genetic markers are likely to have become separated from the trait susceptibility 

1 0 allele through the process of recombination, except in regions of DNA which immediately 

surround the allele. The identities of genetic markers contained within the fragments of DNA 
surrounding a susceptibility allele will be the same as those from the ancestral chromosome on 
which the allele arose. Therefore, individuals from the population who express a complex trait 
might be expected to carry the same set of genetic markers in the vicinity of a susceptibility 

1 5 allele more often than those who do not express the trait; that is these markers will show an 

association with the trait. 
Schizophrenia 

Schizophrenia is one of the most severe and debilitating of the major psychiatric 
diseases. It usually starts in late adolescence or early adult life and often becomes chronic and 

20 disabling. Men and women are at equal risk of developing this illness; however, most males 

become ill between 16 and 25 years old, while females develop symptoms between 25 and 30. 
People with schizophrenia often experience both "positive" symptoms (e.g., delusions, 
hallucinations, disorganized thinking, and agitation) and "negative" symptoms (e.g., lack of 
drive or initiative, social withdrawal, apathy, and emotional unresponsiveness). 

25 Schizophrenia affects 1% of the world population. There are an estimated 45 million 

people with schizophrenia in the world, with more than 33 million of them in the developing 
countries. This disease places a heavy burden on the patient's family and relatives, both in 
terms of the direct and indirect costs involved and the social stigma associated with the illness, 
sometimes over generations. Such stigma often leads to isolation and neglect. 

30 Moreover, schizophrenia accounts for one fourth of all mental health costs and takes up 

one in three psychiatric hospital beds. Most schizophrenia patients are never able to work. The 
cost of schizophrenia to society is enormous, In the United States, for example, the direct cost 
of treatment of schizophrenia has been estimated to be close to 0.5% of the gross national 
product. Standardized mortality ratios (SMRs) for schizophrenic patients are estimated to be 

35 two to four times higher than the general population, and their life expectancy overall is 20 % 
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shorter than for the general population. The most common cause of death among schizophrenic 
patients is suicide (in 1 0 % of patients) which represents a 20 times higher risk than for the 
general population. Deaths from heart disease and from diseases of the respiratory and 
digestive system are also increased among schizophrenic patients. 
5 Bipolar Disorder 

Bipolar disorders are relatively common disorders with severe and potentially disabling 
effects. In addition to the severe effects on patients' social development, suicide completion 
rates among bipolar patients are reported to be about 15%. 

Bipolar disorders are characterized by phases of excitement and often including 

10 depression; the excitement phases, referred to as mania or hypomania, and depression can 

alternate or occur in various admixtures, and can occur to different degrees of severity and over 
varying time periods. Because bipolar disorders can exist in different forms and display 
different symptoms, the classification of bipolar disorder has been the subject of extensive 
studies resulting in the definition of bipolar disorder subtypes and widening of the overall 

15 concept to include patients previously thought to be suffering from different disorders. Bipolar 

disorders often share certain clinical signs, symptoms, treatments and neurobiological features 
with psychotic illnesses in general and therefore present a challenge to the psychiatrist to make 
an accurate diagnosis. Furthermore, because the course of bipolar disorders and various mood 
and psychotic disorders can differ greatly, it is critical to characterize the illness as early as 

20 possible in order to offer means to manage the illness over a long term. 

Bipolar disorders appear in about 1.3% of the population and have been reported to 
constitute about half of the mood disorders seen in a psychiatric clinic. Bipolar disorders have 
been found to vary with gender depending of the type of disorder; for example, bipolar disorder 
I is found equally among men and women, while bipolar disorder II is reportedly more common 

25 in women. The age of onset of bipolar disorders is typically in the teenage years and diagnosis 

is typically made in the patient's early twenties. Bipolar disorders also occur among the elderly, 
generally as a result of a medical or neurological disorder. 

The costs of bipolar disorders to society are enormous. The mania associated with the 
disease impairs performance and causes psychosis, and often results in hospitalization. This 

30 disease places a heavy burden on the patient's family and relatives, both in terms of the direct 

and indirect costs involved and the social stigma associated with the illness, sometimes over 
generations. Such stigma often leads to isolation and neglect. Furthermore, the earlier the onset, 
the more severe are the effects of interrupted education and social development. 

The DSM-IV classification of bipolar disorder distinguishes among four types of 

35 disorders based on the degree and duration of mania or hypomania as well as two types of 
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disorders which are evident typically with medical conditions or their treatments, or to 
substance abuse. Mania is recognized by elevated, expansive or irritable mood as well as by 
distractability, impulsive behavior, increased activity, grandiosity, elation, racing thoughts, and 
pressured speech. Of the four types of bipolar disorder characterized by the particular degree 
5 and duration of mania » DSM-IV includes: 

- bipolar disorder I, including patients displaying mania for at least one week; 

- bipolar disorder II, including patients displaying hypomania for at least 4 days, 
characterized by milder symptoms of excitement than mania, who have not previously 
displayed mania, and have previously suffered from episodes of major depression; 

10 - bipolar disorder not otherwise specified (NOS), including patients otherwise 

displaying features of bipolar disorder II but not meeting the 4 day duration for the excitement 
phase, or who display hypomania without an episode of major depression; and 

- cyclothymia, including patients who show numerous manic and depressive symptoms 
that do not meet the criteria for hypomania or major depression, but which are displayed for 

15 over two years without a symptom-free interval of more than two months. 

The remaining two types of bipolar disorder as classified in DSM-VI are disorders 
evident or caused by various medical disorder and their treatments, and disorders involving or 
related to substance abuse. Medical disorders which can cause bipolar disorders typically 
include endocrine disorders and cerebrovascular injuries, and medical treatments causing 

20 bipolar disorder are known to include glucocorticoids and the abuse of stimulants. The disorder 

associated with the use or abuse of a substance is referred to as "substance induced mood 
disorder with manic or mixed features". 

Diagnosis of bipolar disorder can be very challenging. One particularly troublesome 
difficulty is that some patients exihibit mixed states, simultaneously manic and dysphoric or 

25 depressive, but do not fall into the DSM-IV classification because not all required criteria for 

mania and major depression are met daily for at least one week. Other difficulties include 
classification of patients in the DSM-IV groups based on duration of phase since patients often 
cycle between excited and depressive episodes at different rates. In particular, it is reported that 
the use of antidepressants may alter the course of the disease for the worse by causing "rapid- 

30 cycling". Also making diagnosis more difficult is the fact that bipolar patients, particularly at 

what is known as Stage III mania, share symptoms of disorganized thinking and behavior with 
bipolar disorder patients. Furthermore, psychiatrists must distinguish between agitated 
depression and mixed mania; it is common that patients with major depression (14 days or 
more) exhibit agitiation, resulting in bipolar-like features. A yet further complicating factor is 

35 that bipolar patients have an exceptionally high rate of substance, particularly alcohol abuse. 
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While the prevalence of mania in alcoholic patients is low, it is well known that substance 
abusers can show excited symptoms. Difficulties therefore result for the diagnosis of bipolar 
patients with substance abuse. 
Treatment 

5 As there are currently no cures for bipolar disorder or schizophrenia, the objective of 

treatment is to reduce the severity of the symptoms, if possible to the point of remission. Due to 
the similarities in symptoms, schizophrenia and bipolar disorder are often treated with some of 
the same medicaments. Both diseases are often treated with antipsychotics and neuroleptics. 
For schizophrenia, for example, antipsychotic medications are the most common and 

10 most valuable treatments. There are four main classes of antipsychotic drugs which are 

commonly prescribed for schizophrenia. The first, neuroleptics, exemplified by chlorpromazine 
(Thorazine), has revolutionized the treatment of schizophrenic patients by reducing positive 
(psychotic) symptoms and preventing their recurrence. Patients receiving chlorpromazine have 
been able to leave mental hospitals and live in community programs or their own homes. But 

15 these drugs are far from ideal. Some 20% to 30% of patients do not respond to them at all, and 

others eventually relapse. These drugs were named neuroleptics because they produce serious 
neurological side effects, including rigidity and tremors in the arms and legs, muscle spasms, 
abnormal body movements, and akath is ia (restless pacing and fidgeting). These side effects are 
so troublesome that many patients simply refuse to take the drugs. Besides, neuroleptics do not 

20 improve the so-called negative symptoms of schizophrenia and the side effects may even 

exacerbate these symptoms. Thus, despite the clear beneficial effects of neuroleptics, even 
some patients who have a good short-term response will ultimately deteriorate in overall 
functioning. 

The well known deficiencies in the standard neuroleptics have stimulated a search for 
25 new treatments and have led to a new class of drugs termed atypical neuroleptics. The first 

atypical neuroleptic, Clozapine, is effective for about one third of patients who do not respond 
to standard neuroleptics. It seems to reduce negative as well as positive symptoms, or at least 
exacerbates negative symptoms less than standard neuroleptics do. Moreover, it has beneficial 
effects on overall functioning and may reduce the chance of suicide in schizophrenic patients. It 
30 does not produce the troubling neurological symptoms of the standard neuroleptics, or raise 

blood levels of the hormone prolactin, excess of which may cause menstrual irregularities and 
infertility in women, impotence or breast enlargement in men. Many patients who cannot 
tolerate standard neuroleptics have been able to take clozapine. However, clozapine has serious 
limitations. It was originally withdrawn from the market because it can cause agranulocytosis, a 
35 potentially lethal inability to produce white blood cells, Agranulocytosis remains a threat that 
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requires careful monitoring and periodic blood tests. Clozapine can also cause seizures and 
other disturbing side effects (e.g., drowsiness, lowered blood pressure, drooling, bed-wetting, 
and weight gain). Thus it is usually taken only by patients who do not respond to other drugs. 

Researchers have developed a third class of antipsychotic drugs that have the virtues of 
5 clozapine without its defects. One of these drugs is risperidone (Risperdal). Early studies 

suggest that it is as effective as standard neuroleptic drugs for positive symptoms and may be 
somewhat more effective for negative symptoms. It produces more neurological side effects 
than clozapine but fewer than standard neuroleptics. However, it raises prolactin levels. 
Risperidone is now prescribed for a broad range of psychotic patients, and many clinicians seem 

10 to use it before clozapine for patients who do not respond to standard drugs, because they regard 

it as safer. Another new drug is Olanzapine (Zyprexa) which is at least as effective as standard 
drugs for positive symptoms and more effective for negative symptoms. It has few neurological 
side effects at ordinary clinical doses, and it does not significantly raise prolactin levels. 
Although it does not produce most of clozapine's most troubling side effects, including 

15 agranulocytosis, some patients taking olanzapine may become sedated or dizzy, develop dry 

mouth, or gain weight. In rare cases, liver function tests become transiently abnormal. 

Outcome studies in schizophrenia are usually based on hospital treatment studies and 
may not be representative of the population of schizophrenia patients. At the extremes of 
outcome, 20 % of patients seem to recover completely after one episode of psychosis, whereas 

20 14-19% of patients develop a chronic unremitting psychosis and never fully recover. In general, 

clinical outcome at five years seems to follow the rule of thirds: with about 35 % of patients in 
the poor outcome category; 36 % in the good outcome category, and the remainder with 
intermediate outcome. Prognosis in schizophrenia does not seem to worsen after five years. 

Whatever the reasons, there is increasing evidence that leaving schizophrenia untreated 

25 for long periods early in course of the illness may negatively affect the outcome. However, the 

use of drugs is often delayed for patients experiencing a first episode of the illness. The patients 
may not realize that they are ill, or they may be afraid to seek help; family members sometimes 
hope the problem will simply disappear or cannot persuade the patient to seek treatment; 
clinicians may hesitate to prescribe antipsychotic medications when the diagnosis is uncertain 

30 because of potential side effects. Indeed, at the first manifestation of the disease, schizophrenia 

is difficult to distinguish from bipolar manic-depressive disorders, severe depression, drug- 
related disorders, and stress-related disorders. Since the optimum treatments differ among these 
diseases, the long term prognosis of the disorder also differs the beginning of the treatment. 
For both schteophrenia and bipolar disorder, all the known molecules used for the 

35 treatment of schizophrenia have side effects and act only against the symptoms of the disease. 
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There is a strong need for new molecules without associated side effects and directed against 
targets which are involved in the causal mechanisms of schizophrenia and bipolar disorder. 
Therefore , tools facilitating the discovery and characterization of these targets are necessary and 
useful. 

5 The aggregation of schizophrenia and bipolar disorder in families, the evidence from 

twin and adoption studies, and the lack of variation in incidence worldwide, indicate that 
schizophrenia and bipolar disorder are primarily genetic conditions, although environmental 
risk factors are also involved at some level as necessary, sufficient, or interactive causes. For 
example, schizophrenia occurs in 1% of the general population. But, if there is one grandparent 

10 with schizophrenia, the risk of getting the illness increases to about 3%; one parent with 

Schizophrenia, to about 10%. When both parents have schizophrenia, the risk rises to 
approximately 40%. 

Consequently, there is a strong need to identify genes involved in schizophrenia and 
bipolar disorder. The knowledge of these genes will allow researchers to understand the 
15 etiology of schizophrenia and bipolar disorder and could lead to drugs and medications which 

are directed against the cause of the diseases, not just against their symptoms. 

There is also a great need for new methods for detecting a susceptibility to 
schizophrenia and bipolar disorder, as well as for preventing or following up the development 
of the disease. Diagnostic tools could also prove extremely useful. Indeed, early identification 
20 of subjects at risk of developing schizophrenia would enable early and/or prophylactic treatment 

< to be administered. Moreover, accurate assessments of the eventual efficacy of a medicament 
as well as the patent's eventual tolerance to it may enable clinicians to enhance the benefit/risk 
ratio of schizophrenia and bipolar disorder treatment regimes. 

25 SUMMARY OF THE INVENTION 

The present invention stems from the identification of novel polymorphisms including 
bialielic markers located on the human chromosome 13q31-q33 locus, the identification and 
characterization of novel schizophrenia-related genes located on the human chromosome 
13q31-q33 locus, and from the identification of genetic associations between alleles of bialielic 

30 markers located on the human chromosome 13q31-q33 locus and disease, as confirmed and 

characterized in a panel of human subjects. The invention furthermore provides a fine structure 
map of the region which includes the schizophrenia-associated gene sequences. 

The present invention pertains to nucleic acid molecules comprising the genomic 
sequences of novel human genes encoding g35030 proteins, proteins encoded thereby, as well 

35 as antibodies thereto. The g3503O genomic sequences may also comprise regulatory sequence 
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located upstream (5 '-end) and downstream (3 '-end) of the transcribed portion of said gene, 
these regulatory sequences being also part of the invention. The invention also deals with the 
cDNA sequences encoding g3503O proteins. 

Oligonucleotide probes or primers hybridizing specifically with a g35030 genomic or 
5 cDNA sequence are also part of the present invention, as well as DNA amplification and 

detection methods using said primers and probes. 

A further object of the invention consists of recombinant vectors comprising any of the 
nucleic acid sequences described above, and in particular of recombinant vectors comprising a 
g35030 regulatory sequence or a sequence encoding a g35030 protein, as well as of cell hosts 
10 and transgenic non human animals comprising said nucleic acid sequences or recombinant 

vectors. 

The invention also concerns to biallelic markers of the g3503O gene and the use thereof. 
Included are probes and primers for use in genotyping biallelic markers of the invention. 

An embodiment of the invention encompasses any polynucleotide of the invention 

15 attached to a solid support polynucleotide may comprise a sequence disclosed in the present 

specification; optionally, said polynucleotide may comprise, consist of, or consist essentially of 
any polynucleotide described in the present specification; optionally, said determining may be 
performed in a hybridization assay, sequencing assay, microsequencing assay, or an enzyme- 
based mismatch detection assay; optionally, said polynucleotide may be attached to a solid 

20 support, array, or addressable array; optionally, said polynucleotide may be labeled. 

Finally, the invention is directed to drug screening assays and methods for the screening 
of substances for the treatment of schizophrenia, bipolar disorder or a related CNS disorder. 
One object of the invention deals with animal models of schizophrenia, including mouse, 
primate, non-human primate bipolar disorder or related CNS disorder. The invention is also 

25 directed to methods for the screening of substances or molecules that modulate the expression 

of g35030, as well as with methods for the screening of substances or molecules that interact 
with a g3503O polypeptide, or that modulate the activity of a g35030 polypeptide. 

As noted above, certain aspects of the present invention stem from the identification of 
genetic associations between schizophrenia and bipolar disorder and alleles of biallelic markers 

30 located on the human chromosome 13q31-q33 region, and more particularly on a subregion 

thereof referred to herein as Region D. The invention provides appropriate tools for 
establishing further genetic associations between alleles of biallelic markers on the 13q3 1- 
13q33 locus and either side effects or benefit resulting from the administration of agents acting 
on schizophrenia or bipolar disorder, or schizophrenia or bipolar disorder symptoms, including 

35 agents like chlorpromazine, clozapine, risperidone, olanzapine, sertindole, quetiapine and 
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ziprasidone. 

The invention provides appropriate tools for establishing further genetic associations 
between alleles of biallelic markers on the 13q31-q33 locus and a trait. Methods and products 
are provided for the molecular detection of a genetic susceptibility in humans to schizophrenia 
5 and bipolar disorder. They can be used for diagnosis, staging, prognosis and monitoring of this 

disease, which processes can be further included within treatment approaches. The invention 
also provides for the efficient design and evaluation of suitable therapeutic solutions including 
individualized strategies for optimizing drug usage, and screening of potential new medicament 
candidates. 

10 Additional embodiments are set forth in the Detailed Description of the Invention and 

in the Examples. 

BRIEF DESCRIPTION OF THE FIGURES 
Figure 1 is a diagram showing the exon structure of the g35030 gene. 
15 Figure 2 is a table demonstrating the statistical significance of allelic frequencies of 

selected chromosome 13q31-q33 biallelic markers of the invention in sporadic and familial 
French Canadian schizophrenia cases and controls. 

Figure 3 is a table demonstrating the results of a haplotype association analysis between 
total French Canadian schizophrenia cases and haplotypes which consist of chromosome 1 3q3 1 - 
20 q33 biallelic markers of the invention. 

Figure 4 is a table showing the involvement of selected biallelic markers of the 
invention in statistically significant haplotypes. 

Figure 5 is a table demonstrating the results of a haplotype association analysis between 
French Canadian schizophrenia cases and haplotypes which consist of chromosome 13q3 1 -q33 
25 biallelic markers of the invention. 

Figure 6 is a table demonstrating the results of a haplotype association analysis between 
French Canadian schizophrenia cases and haplotypes which consist of chromosome 13q3l-q33 
biallelic markers of the invention. 

Figures 7 A and 7B show the results of a haplotype association analysis (Omnibus LR 
30 test value distribution) between schizophrenia cases and haplotypes comprising Region D 

biallelic markers of the invention. 

Figures 8 A and 8B show the results of a haplotype association analysis (HaplotMaxM 
test value distribution) between schizophrenia cases and haplotypes comprising Region D 
biallelic markers of the invention. 
35 Figures 9A and 9B show the results of a haplotype association analysis (Omnibus LR 
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test value distribution) between bipolar disorder cases and haplotypes comprising Region D 
biallelic markers of the invention. 

Figures 1 OA and 10B show the results of a hapforype association analysis (HapIoMaxM 
test value distribution) between bipolar disorder cases and haplotypes comprising Region D 
5 biallelic markers of the invention. 

Figures 1 1 A and 1 IB show the results of a haplotype association analysis (HaploMaxS 
test value distribution) between bipolar disorder cases and haplotypes comprising Region D 
biallelic markers of the invention. 

Figure 1 2 is a block diagram of an exemplary computer system, 
1 0 Figure 13 is a flow diagram illustrating one embodiment of a process 200 for comparing a 

new nucleotide or protein sequence with a database of sequences in order to determine the 
homology levels between the new sequence and the sequences in the database. 

Figure 14 is a flow diagram illustrating one embodiment of a process 250 in a computer 
for determining whether two sequences are homologous. 
1 5 Figure 15 is a flow diagram illustrating one embodiment of an identifier process 300 for 

detecting the presence of a feature in a sequence. 

BRIEF DESCRIPTION OF THE SEQUENCES PROVIDED IN THE 
SEQUENCE LISTING 

20 SEQ ID No. 1 contains the approximately 319kb of genomic nucleotide sequence 

comprising §35030 nucleic acid sequences and the biallelic markers Al to A71 located on the 
human chromosome 1 3q3 1 -q33 locus. 

SEQ ID Nos. 2 to 17 contain cDNA sequences of the g3S030 gene. 
SEQ ID Nos. 1 8 to 23 contain amino acid sequences of g35030 polypeptides, encoded 
25 by cDNAs of the invention. 

SEQ ID Nos. 24 to 78 respectively contain the nucleotide sequence of the amplicons 
which comprise the biallelic markers A72 to A 1 27 located on the human chromosome 13q3 1- 
q33 locus. 

SEQ ID No. 79 to 132 contain genomic nucleotide sequences comprising exons of the 
30 g35030 gene from several different primates. 

SEQ ID No 133 contains a primer containing the additional PU 5' sequence described 
further in Example 2. 

SEQ ID No 134 contains a primer containing the additional RP 5 J sequence described 
further in Example 2. 



35 
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In accordance with the regulations relating to Sequence Listings, the following codes 
have been used in the Sequence Listing to indicate the locations of biallelic markers within the 
sequences and to identify each of the alleles present at the polymorphic base. The code "r" in 
the sequences indicates that one allele of the polymorphic base is a guanine, while the other 
5 allele is an adenine. The code "y" in the sequences indicates that one allele of the polymorphic 

base is a thymine, while the other allele is a cytosine. The code "tn" in the sequences indicates 
that one allele of the polymorphic base is an adenine, while the other allele is an cytosine. The 
code "k" in the sequences indicates that one allele of the polymorphic base is a guanine, while 
the other allele is a thymine. The code V in the sequences indicates that one allele of the 
10 polymorphic base is a guanine, while the other allele is a cytosine. The code "w" in the 

sequences indicates that one allele of the polymorphic base is an adenine, while the other allele 
is an thymine. 

DETAILED DESCRIPTION OF THE INVENTION 

The identification of genes involved in a particular trait such as a specific central 

15 nervous system disorder, like schizophrenia, can be carried out through two main strategies 

currently used for genetic mapping: linkage analysis and association studies. Linkage analysis 
requires the study of families with multiple affected individuals and is now useful in the 
detection of mono- or oligogenic inherited traits. Conversely, association studies examine the 
frequency of marker alleles in unrelated trait (T+) individuals compared with trait negative (T-) 

20 controls, and are generally employed in the detection of polygenic inheritance. 

Candidate region on the chromosome 13 (linkage analysis) 
Genetic link or "linkage" is based on an analysis of which of two neighboring 
sequences on a chromosome contains the least recombinations by crossing-over during meiosis. 
To do this, chromosomal markers, like microsatellite markers, have been localized with 

25 precision on the genome. Genetic link analysis calculates the probabilities of recombinations on 

the target gene with the chromosomal markers used, according to the genealogical tree, the 
transmission of the disease, and the transmission of the markers. Thus, if a particular allele of a 
given marker is transmitted with the disease more often than chance would have it 
(recombination level between 0 and 0.5), it is possible to deduce that the target gene in question 

30 is found in the neighborhood of the marker. 

Using this technique, it has been possible to localize several genes demonstrating a 
genetic predisposition of familial cancers. In order to be able to be included in a genetic link 
study, the families affected by a hereditary form of the disease must satisfy the 
"inforrnativeness" criteria: several affected subjects (and whose constitutional DNA is 

35 available) per generation, and at best having a large number of siblings. 
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By linkage analysis, observations have been made, according to which a candidate 
region for schizophrenia is present on chromosome 13q32 locus (Blouin et ah, 1998). Linkage 
analysis has been successfully applied to map simple genetic traits that show clear Mendelian 
inheritance patterns and which have a high penetrance, but this method suffers from a variety of 
5 drawbacks. First, linkage analysis is limited by its reliance on the choice of a genetic model 

suitable for each studied trait. Furthermore, the resolution attainable using linkage analysis is 
limited, and complementary studies are required to refine the analysis of the typical 20 Mb 
regions initially identified through this method. In addition, linkage analysis have proven 
difficult when applied to complex genetic traits, such as those due to the combined action of 

10 multiple genes and/or environmental factors. Tn such cases, too great an effort and cost are 

needed to recruit the adequate number of affected families required for applying linkage 
analysis to these situations. Finally, linkage analysis cannot be applied to the study of traits for 
which no large informative families are available. 

In the present invention alternative means for conducting association studies rather than 

1 5 linkage analysis between markers located on the chromosome 1 3q3 1 -q33 locus and a trait, 

preferably schizophrenia or bipolar disorder, are disclosed. 

In the present application, additional biallelic markers located on the human 
chromosome 13q31-q33 locus associated with schizophrenia are disclosed. The identification 
of these biallelic markers in association with schizophrenia has allowed for the further 

20 definition of the chromosomal region suspected of containing a genetic determinant involved in 

a predisposition to develop schizophrenia and has resulted in the identification of novel gene 
sequences disclosed herein which are associated with a predisposition to develop schizophrenia. 
The present invention thus provides an extensive fine structure map of the 13q31-q33 locus, 
including novel biallelic markers located on the human 1 3q3 1 -q33 locus, approximately 3 1 9kb 

25 of genomic nucleotide sequence of a subregion of the human 13q3 l-q33 locus, and 

polymorphisms including biallelic markers and nucleotide deletions in said 319kb genomic 
sequence. The biallelic markers of the human chromosome 13q3 l-q33 locus and the nucleotide 
sequences, polymorphisms and gene sequences located in Region D subregion of the human 
chromosome 13q3 l-q33 locus are useful as genetic and physical markers for further mapping 

30 studies. These identified polymorphisms are used in the design of assays for the reliable 

detection of genetic susceptibility to schizophrenia and bipolar disorder. They can also be used 
in the design of drug screening protocols to provide an accurate and efficient evaluation of the 
therapeutic and side-effect potential of new or already existing medicament or treatment regime. 
Using the biallelic markers, the inventors have identified the g35030 candidate 

35 schizophrenia gene, further described herein. The sequences comprising the the schizophrenia- 
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associated genes are useful, for example, for the isolation of other genes in putative gene 
families, the identification of homologs from other species, for the development of medicaments 
for treatment of disease, and as probes and primers for diagnostic or screening assays as 
described herein." 

5 

Definitions 

As used interchangeably herein, the term " oligonucleotides ", and " polynucleotides " 
include RNA, DNA, or RNA/DNA hybrid sequences of more than one nucleotide in either 
single chain or duplex form. The term " nucleotide " as used herein as an adjective to describe 

10 molecules comprising RNA, DNA, or RNA/DNA hybrid sequences of any length in single- 

stranded or duplex form. The term "nucleotide" is also used herein as a noun to refer to 
individual nucleotides or varieties of nucleotides, meaning a molecule, or individual unit in a 
larger nucleic acid molecule, comprising a purine or pyrimidine, a ribose or deoxyribose sugar 
moiety, and a phosphate group, or phosphodiester linkage in the case of nucleotides within an 

1 5 oligonucleotide or polynucleotide. Although the term "nucleotide" is also used herein to 

encompass "modified nucleotides" which comprise at least one modifications (a) an alternative 
linking group, (b) an analogous form of purine, (c) an analogous form of pyrimidine, or (d) an 
analogous sugar, for examples of analogous linking groups, purine, pyrimidines, and sugars see 
for example PCT publication No. WO 95/04064, However, the polynucleotides of the invention 

20 are preferably comprised of greater than 50% conventional deoxyribose nucleotides, and most 

preferably greater than 90% conventional deoxyribose nucleotides. The polynucleotide 
sequences of the invention may be prepared by any known method, including synthetic, 
recombinant, ex vivo generation, or a combination thereof, as well as utilizing any purification 
methods known in the art. 

25 As used herein, the term "purified" does not require absolute purity; rather, it is 

intended as a relative definition. Purification of starting material or natural material is at least 
one order of magnitude, preferably two or three orders, and more preferably four or five orders 
of magnitude is expressly contemplated. As an example, purification from 0.1 % concentration 
to 10 % concentration is two orders of magnitude. 

30 To illustrate, individual cDNA clones isolated from a cDNA library have been 

conventionally purified to electrophoretic homogeneity. The sequences obtained from these 
clones could not be obtained directly either from the library or from total human DNA. The 
cDNA clones are not naturally occurring as such, but rather are obtained via manipulation of a 
partially purified naturally occurring substance (messenger RNA). The conversion of mRNA 

35 into a cDNA library involves the creation of a synthetic substance (cDNA) and pure individual 
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cDNA clones can be isolated from the synthetic library by clonal selection. Thus, creating a 
cDNA library from messenger RNA and subsequently isolating individual clones from that 
library results in an approximately 10 4 -10 6 fold purification of the native message. 

The term "purified** is further used herein to describe a polypeptide or polynucleotide of 
5 the invention which has been separated from other compounds including, but not limited to, 

polypeptides or polynucleotides, carbohydrates, lipids, etc. The term "purified" may be used to 
specify the separation of monomeric polypeptides of the invention from oligomeric forms such 
as homo- or hetero- dimers, trimers, etc. The term "purfied" may also be used to specify the 
separation of covalently closed polynucleotides from linear polynucleotides. A polynucleotide 

1 0 is substantially pure when at least about 50%, preferably 60 to 75% of a sample exhibits a single 

polynucleotide sequence and conformation (linear versus covalently close). A substantially 
pure polypeptide or polynucleotide typically comprises about 50%, preferably 60 to 90% 
weight/weight of a polypeptide or polynucleotide sample, respectively, more usually about 
95%, and preferably is over about 99% pure. Polypeptide and polynucleotide purity, or 

1 5 homogeneity, is indicated by a number of means well known in the art, such as agarose or 

polyacrylamide gel electrophoresis of a sample, followed by visualizing a single band upon 
staining the gel. For certain purposes higher resolution can be provided by using HPLC or other 
means well known in the art. As an alternative embodiment, purification of the polypeptides and 
polynucleotides of the present invention may be expressed as "at least" a percent purity relative to 

20 heterologous polypeptides and polynucleotides (DNA, RNA or both). As a preferred embodiment, 

the polypeptides and polynucleotides of the present invention are at least; 10%, 20%, 30%, 40%, 
50%, 60%, 70%, 80%, 90%, 95%, 96%, 96%, 98%, 99%, or 100% pure relative to heterologous 
polypeptides and polynucleotides, respectively. As a further preferred embodiment the 
polypeptides and polynucleotides have a purity ranging from any number, to the thousandth 

25 position, between 90% and 100% (e.g., a polypeptide or polynucleotide at least 99.995% pure) 

relative to either heterologous polypeptides or polynucleotides, respectively, or as a weight/weight 
ratio relative to all compounds and molecules other than those existing in the carrier. Each 
number representing a percent purity, to the thousandth position, may be claimed as individual 
species of purity. 

30 The term " isolated ** requires that the material be removed from its original environment 

(e.g., the natural environment if it is naturally occurring). For example, a naturally-occurring 
polynucleotide or polypeptide present in a living animal is not isolated, but the same 
polynucleotide or DNA or polypeptide, separated from some or all of the coexisting materials in 
the natural system, is isolated. Such polynucleotide could be part of a vector and/or such 

35 polynucleotide or polypeptide could be part of a composition, and still be isolated in that the 
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vector or composition is not part of its natural environment 

The term " primer " denotes a specific oligonucleotide sequence which is complementary 
to a target nucleotide sequence and used to hybridize to the target nucleotide sequence. A 
primer serves as an initiation point for nucleotide polymerization catalyzed by either DNA 
5 polymerase, RNA polymerase or reverse transcriptase. 

The term " probe " denotes a defined nucleic acid segment (or nucleotide analog 
segment, e.g., polynucleotide as defined herein) which can be used to identify a specific 
polynucleotide sequence present in samples, said nucleic acid segment comprising a nucleotide 
sequence complementary of the specific polynucleotide sequence to be identified. 
10 The terms " trait" and " phenotvpe " are used interchangeably herein and refer to any 

clinically distinguishable, detectable or otherwise measurable property of an organism such as 
symptoms of, or susceptibility to a disease for example, Typically the terms "trait" or 
"phenotype" are used herein to refer to symptoms of, or susceptibility to schizophrenia or 
bipolar disorder; or to refer to an individual's response to an agent acting on schizophrenia or 
1 5 bipolar disorder; or to refer to symptoms of, or susceptibility to side effects to an agent acting 

on schizophrenia or bipolar disorder. 

The term "allele" is used herein to refer to variants of a nucleotide sequence. A 
biallelic polymorphism has two forms. Typically the first identified allele is designated as the 
original allele whereas other alleles are designated as alternative alleles. Diploid organisms 
20 may be homozygous or heterozygous for an allelic form. 

The term " heterozygosity rate " is used herein to refer to the incidence of individuals in 
a population, which are heterozygous at a particular allele. In a biallelic system the 
heterozygosity rate is on average equal to 2P a (l-P a )> where P a is the frequency of the least 
common allele. In order to be useful in genetic studies a genetic marker should have an 
25 adequate level of heterozygosity to allow a reasonable probability that a randomly selected 

person will be heterozygous. 

The term " genotype " as used herein refers the identity of the alleles present in an 
individual or a sample. In the context of the present invention a genotype preferably refers to 
the description of the biallelic marker alleles present in an individual or a sample. The term 
30 " genotvping " a sample or an individual for a biallelic marker involves determining the specific 

allele or the specific nucleotide(s) carried by an individual at a biallelic marker. 

The term "mutation " as used herein refers to a difference in DNA sequence between or 
among different genomes or individuals which has a frequency below 1%. 

The term " haplotvp e" refers to a combination of alleles present in an individual or a 
35 sample on a single chromosome. In the context of the present invention a haplotype preferably 
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refers to a combination of biallelic marker alleles found in a given individual and which may be 
associated with a phenotype. 

The term " polymorphism " as used herein refers to the occurrence of two or more 
alternative genomic sequences or alleles between or among different genomes or individuals. 

5 "Polymorphic" refers to the condition in which two or more variants of a specific genomic 

sequence can be found in a population. A " polymorphic site " is the locus at which the variation 
occurs. A polymorphism may comprise a substitution, deletion or insertion of one or more 
nucleotides. A single nucleotide polymorphism is a single base pair change. Typically a single 
nucleotide polymorphism is the replacement of one nucleotide by another nucleotide at the 

10 polymorphic site. Deletion of a single nucleotide or insertion of a single nucleotide, also give 

rise to single nucleotide polymorphisms. In the context of the present invention "single 
nucleotide polymorphism" preferably refers to a single nucleotide substitution. Typically, 
between different genomes or between different individuals, the polymorphic site may be 
occupied by two different nucleotides . 

15 The terms " biallelic polymorphism " and "biallelic marker " are used interchangeably 

herein to refer to a polymorphism having two alleles at a fairly high frequency in the 
population, preferably a single nucleotide polymorphism. A " biallelic marker allele " refers to 
the nucleotide variants present at a biallelic marker site. Typically the frequency of the less 
common allele of the biallelic markers of the present invention has been validated to be greater 

20 than 1%, preferably the frequency is greater than 10%, more preferably the frequency is at least 

20% (i.e. hetero2ygosity rate of at least 0.32), even more preferably the frequency is at least 
30% (i.e. heterozygosity rate of at least 0.42). A biallelic marker wherein the frequency of the 
less common allele is 30% or more is termed a " high quality biallelic marker ." All of the 
genotyping, haplotyping, association, and interaction study methods of the invention may 

25 optionally be performed solely with high quality biallelic markers. 

The location of nucleotides in a polynucleotide with respect to the center of the 
polynucleotide are described herein in the following manner. When a polynucleotide has an 
odd number of nucleotides, the nucleotide at an equal distance from the 3* and 5 f ends of the 
polynucleotide is considered to be "at the center " of the polynucleotide, and any nucleotide 

30 immediately adjacent to the nucleotide at the center, or the nucleotide at the center itself is 

considered to be "within 1 nucleotide of the center ." With an odd number of nucleotides in a 
polynucleotide any of the Five nucleotides positions in the middle of the polynucleotide would 
be considered to be within 2 nucleotides of the center, and so on. When a polynucleotide has an 
even number of nucleotides, there would be a bond and not a nucleotide at the center of the 

35 polynucleotide. Thus, either of the two central nucleotides would be considered to be "within 1 
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nucleotide of the center" and any of the four nucleotides in the middle of the polynucleotide 
would be considered to be "within 2 nucleotides of the center", and so on. For polymorphisms 
which involve the substitution, insertion or deletion of 1 or more nucleotides, the 
polymorphism, allele or biallelic marker is "at the center" of a polynucleotide if the difference 
5 between the distance from the substituted, inserted, or deleted polynucleotides of the 

polymorphism and the 3' end of the polynucleotide, and the distance from the substituted, 
inserted, or deleted polynucleotides of the polymorphism and the 5' end of the polynucleotide is 
zero or one nucleotide. If this difference is 0 to 3, then the polymorphism is considered to be 
"within 1 nucleotide of the center." If the difference is 0 to 5, the polymorphism is considered 

10 to be "within 2 nucleotides of the center." If the difference is 0 to 7, the polymorphism is 

considered to be "within 3 nucleotides of the center," and so on. For polymorphisms which 
involve the substitution, insertion or deletion of 1 or more nucleotides, the polymorphism, allele 
or biallelic marker is "at the center" of a polynucleotide if the difference between the distance 
from the substituted, inserted, or deleted polynucleotides of the polymorphism and the 3* end of 

15 the polynucleotide, and the distance from the substituted, inserted, or deleted polynucleotides of 

the polymorphism and the 5* end of the polynucleotide is zero or one nucleotide. If this 
difference is 0 to 3, then the polymorphism is considered to be "within 1 nucleotide of the 
center." If the difference is 0 to 5, the polymorphism is considered to be "within 2 nucleotides 
of the center." If the difference is 0 to 7, the polymorphism is considered to be "within 3 

20 nucleotides of the center," and so on. 

The term " upstream " is used herein to refer to a location which, is toward the 5* end of 
the polynucleotide from a specific reference point. 

The terms " base paired " and " Watson & Crick base paired " are used interchangeably 
herein to refer to nucleotides which can be hydrogen bonded to one another be virtue of their 

25 sequence identities in a manner like that found in double-helical DNA with thymine or uracil 

residues linked to adenine residues by two hydrogen bonds and cytosine and guanine residues 
linked by three hydrogen bonds (See Stryer, L., Biochemistry, 4th edition, 1995). 

The terms "complementary " or " complement thereof are used herein to refer to the 
sequences of polynucleotides which is capable of forming Watson & Crick base pairing with 

30 another specified polynucleotide throughout the entirety of the complementary region. This 

term is applied to pairs of polynucleotides based solely upon their sequences and not any 
particular set of conditions under which the two polynucleotides would actually bind. 

The terms " g35O30 gene ", when used herein, encompasses genomic, mRNA and 
cDNA sequences encoding a g35030 protein, including the untranslated regulatory regions of 

35 the genomic DNA. 
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As used herein the term " 13q31-q33-related biallelic marker " relates to a bialielic 
marker residing in the human chromosome 1 3q31 -q33 region. The term 13q3 1 -q33-related 
biallelic marker encompasses all of the biallelic markers disclosed in Table 5 and any biallelic 
markers in linkage disequilibrium therewith, and any biallelic markers in linkage disequilibrium 
5 therewith. The preferred chromosome 13q3 l-q33-related biallelic marker alleles of the present 

invention include each one the alleles described in Table 5 individually or in groups consisting 
of all the possible combinations of the alleles listed. 

As used herein the term " g35Q3Q-related biallelic marker" relates to a set of biallelic 
markers in linkage disequilibrium with the g3503O gene or a g35030 nucleotide sequence. The 
10 term g35030-related biallelic marker encompasses the biallelic markers A13 to A65 disclosed in 

Table 5 and any biallelic markers in linkage disequilibrium therewith. 

The term " polypeptide " refers to a polymer of amino acids without regard to the length 
of the polymer; thus, peptides, oligopeptides, and proteins are included within the definition of 
polypeptide. This term also does not specify or exclude prost-expression modifications of 
15 polypeptides, for example, polypeptides which include the covalent attachment of glycosyl 

groups, acetyl groups, phosphate groups, lipid groups and the like are expressly encompassed 
by the term polypeptide. Also included within the definition are polypeptides which contain 
one or more analogs of an amino acid (including, for example, non-naturally occurring amino 
acids, amino acids which only occur naturally in an unrelated biological system, modified 
20 amino acids from mammalian systems etc.), polypeptides with substituted linkages, as well as 

other modifications known in the art, both naturally occurring and non-naturally occurring. 

As used herein, the term " non-human animal " refers to any non-human vertebrate, birds 
and more usually mammals, preferably primates, farm animals such as swine, goats, sheep, 
donkeys, and horses, rabbits or rodents, more preferably rats or mice. As used herein, the term 
25 " animal " is used to refer to any vertebrate, preferable a mammal. Both the terms " animal " and 

■" mammal " expressly embrace human subjects unless preceded with the term "non-human". 

As used herein, the term " antibody " refers to a polypeptide or group of polypeptides 
which are comprised of at least one binding domain, where an antibody binding domain is 
formed from the folding of variable domains of an antibody molecule to form three-dimensional 
30 binding spaces with an internal surface shape and charge distribution complementary to the 

features of an antigenic determinant of an antigen., which allows an immunological reaction 
with the antigen. Antibodies include recombinant proteins comprising the binding domains, as 
wells as fragments, including Fab, Fab', F(ab)2, and F(ab')2 fragments. 

As used herein, an " antigenic determinant " is the portion of an antigen molecule, in this 
35 case a g35030 polypeptide, that determines the specificity of the antigen-antibody reaction. An 
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"epitope" refers to an antigenic determinant of a polypeptide. An epitope can comprise as few 
as 3 amino acids in a spatial conformation which is unique to the epitope. Generally an epitope 
comprises at least 6 such amino acids, and more usually at least 8-10 such amino acids. 
Methods for determining the amino acids which make up an epitope include x-ray 
5 crystallography, 2-dimensional nuclear magnetic resonance, and epitope mapping e.g. the 

Pepscan method described by Geysen et at. 1984; PCT Publication No. WO 84/03564; and 
PCT Publication No. WO 84/03506. 



Variants and Fragments 

10 The invention also relates to variants and fragments of the polynucleotides described 

herein, particularly of a nucleotide sequence of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 
132, and particularly of a nucleotide sequence of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 
1 32 containing one or more biallelic markers and/or other polymorphisms according to the 
invention. 

15 Variants of polynucleotides, as the term is used herein, are polynucleotides that differ 

from a reference polynucleotide. A variant of a polynucleotide may be a naturally occurring 
variant such as a naturally occurring allelic variant, or it may be a variant that is not known to 
occur naturally. Such non-naturally occurring variants of the polynucleotide may be made by 
mutagenesis techniques, including those applied to polynucleotides, cells or organisms. 

20 Generally, differences are limited so that the nucleotide sequences of the reference and the 

variant are closely similar overall and, in many regions, identical. 

Variants of polynucleotides according to the invention include, without being limited ' 
to, nucleotide sequences which are at least 95% identical to a polynucleotide selected from the 
group consisting of the nucleotide sequences SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 

25 or to any polynucleotide fragment of at least 8 consecutive nucleotides of a polynucleotide 

selected from the group consisting of the nucleotide SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 
to 132, and preferably at least 99% identical, more particularly at least 99,5% identical, and 
most preferably at least 99.8% identical to a polynucleotide selected from the group consisting 
of the nucleotide SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or to any polynucleotide 

30 fragment of at least 30, 35, 40, 50, 70, 80, 100, 250, 500 , 1000 or 2000, to the extent that the 

length is consistent with the particular sequence ID, consecutive nucleotides of a polynucleotide 
selected from the group consisting of the nucleotide sequences of SEQ ID Nos. 1 to 17, 24, 25, 
28, 74 and 79 to 132. 

Nucleotide changes present in a variant polynucleotide may be silent, which means that 
35 they do not alter the amino acids encoded by the polynucleotide. However, nucleotide changes 
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may also result in amino acid substitutions, additions, deletions, fusions and truncations in the 
polypeptide encoded by the reference sequence. The substitutions, deletions or additions may 
inyolve one or more nucleotides. The variants may be altered in coding or non-coding regions 
or both. Alterations in the coding regions may produce conservative or non-conservative amino 
5 acid substitutions, deletions or additions. 

A polynucleotide fragment is a polynucleotide having a sequence that is entirely the 
same as part but not all of a given nucleotide sequence, preferably the nucleotide sequence of a 
g3503O polynucleotide, and variants thereof, or of a polynucleotide of any of SEQ ID Nos 1 to 
17, 24, 25, 28, 74 and 79 to 132, or a polynucleotide comprising one of the biallelic markers Al 

10 to A127, or the complements thereof. Such fragments may be "free-standing", i.e. not part of or 

fused to other polynucleotides, or they may be comprised within a single larger polynucleotide 
of which they form a part or region. Indeed, several of these fragments may be present within a 
single larger polynucleotide. Optionally, such fragments may comprise, consist of, or consist 
essentially of a contiguous span of at least 8, 10, 12, 15, 18, 20, 25, 30, 35, 40, 50, 70„ 80, 100, 

15 250, 500 , 1000 or 2000 nucleotides in length of any of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 

79 to 132. 

Identity Between Nucleic Acids Or Polypeptides 

The terms " percentage of sequence identity " and " percentage homology" are used 
interchangeably herein to refer to comparisons among polynucleotides and polypeptides, and 

20 are determined by comparing two optimally aligned sequences over a comparison window, 

wherein the portion of the polynucleotide or polypeptide sequence in the comparison window 
may comprise additions or deletions (i.e., gaps) as compared to the reference sequence (which 
does not comprise additions or deletions) for optimal alignment of the two sequences. The 
percentage is calculated by determining the number of positions at which the identical nucleic 

25 acid base or amino acid residue occurs in both sequences to yield the number of matched 

positions, dividing the number of matched positions by the total number of positions in the 
window of comparison and multiplying the result by 100 to yield the percentage of sequence 
identity. Homology is evaluated using any of the variety of sequence comparison algorithms 
and programs known in the art. Such algorithms and programs include, but are by no means 

30 limited to, TBLASTN, BLASTP, FASTA, TFASTA, and CLUSTALW (Pearson and Lipman, 

1 988, Proc. Natl. Acad. Sci. USA 85(8):2444-2448; Altschul et al, 1990, J. Mol. Biol. 
215(3):403-410; Thompson et al., 1994, Nucleic Acids Res. 22(2):4673-4680; Higgins et al., 
1996, Methods Enzymol, 266:383-402; Altschul et al., 1990, J t Mol. Biol. 215(3):403-41 0; 
Altschul et al., 1993, Nature Genetics 3:266-272). In a particularly preferred embodiment, 

35 protein and nucleic acid sequence homologies are evaluated using the Basic Local Alignment 
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Search Tool ("BLAST") which is well known in the art (see, e.g., Karlin and Altschul, 1 990, 
Proc. Natl. Acad. Sci. USA 87:2267-2268; Altschul etal., 1990, J. Mol. Biol. 215:403-410; 
Altschul et ah, 1993, Nature Genetics 3:266-272; Altschul et al., 1997, Nuc. Acids Res. 
25:3389-3402).' In particular, five specific BLAST programs are used to perform the following 
5 task: 

(1) BLASTP and BLAST3 compare an amino acid query sequence against a protein 
sequence database; 

(2) BLASTN compares a nucleotide query sequence against a nucleotide sequence 
database; 

10 (3) BLASTX compares the six-frame conceptual translation products of a query 

nucleotide sequence (both strands) against a protein sequence database; 

(4) TBLASTN compares a query protein sequence against a nucleotide sequence 
database translated in all six reading frames (both strands); and 

(5) TBLASTX compares the six-frame translations of a nucleotide query sequence 
1 5 against the six-frame translations of a nucleotide sequence database. 

The BLAST programs identify homologous sequences by identifying similar segments, 
which are referred to herein as "high-scoring segment pairs," between a query amino or nucleic 
acid sequence and a test sequence which is preferably obtained from a protein or nucleic acid 
sequence database. High-scoring segment pairs are, preferably identified (i.e., aligned) by 

20 means of a scoring matrix, many of which are known in the art. Preferably, the scoring matrix 

used is the BLOSUM62 matrix (Gonnet et al., 1992, Science 256:1443-1445; Henikoff and 
Henikoff, 1993, Proteins 17:49-61). Less preferably, the PAM or PAM250 matrices may also 
be used (see, e.g., Schwartz and Dayhoff, eds., 1978, Matrices for Detecting Distance 
Relationships: Atlas of Protein Sequence and Structure, Washington: National Biomedical 

25 Research Foundation). The BLAST programs evaluate the statistical significance of all high- 

scoring segment pairs identified, and preferably selects those segments which satisfy a user- 
specified threshold of significance, such as a user-specified percent homology. Preferably, the 
statistical significance of a high-scoring segment pair is evaluated using the statistical 
significance formula of Karlin (see, e.g., Karlin and Altschul, 1990, Proc. Natl. Acad. Sci. USA 

30 87:2267-2268). 

The BLAST programs may be used with the default parameters or with modified 
parameters provided by the user. 



Stringent Hybridization Conditions 

By way of example and not limitation, procedures using conditions of high stringency 
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are as follows: prehybridization of filters containing DNA is carried out for 8 h to overnight at 
65°C in buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), I mM EDTA, 0.02% PVP, 
0.02% Ficoll, 0.02% BSA, and 500 jig/ml denatured salmon sperm DNA. Filters are hybridized 
for 48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture 
5 containing 100 u.g/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-labeled probe. 

Subsequently, filter washes can be done at 37°C for I h in a solution containing 2 x SSC, 0,01% 
PVP, 0.01% Ficoll, and 0.01% BSA, followed by a wash in 0,1 X SSC at 50°C for 45 min. 
Following the wash steps, the hybridized probes are detectable by autoradiography. Other 
conditions of high stringency which may be used are well -known in the art and as cited in 

10 Sambrook et ah, 1 989; and Ausubel et al., 1989. These hybridization conditions are suitable 

for a nucleic acid molecule of about 20 nucleotides in length. There is no need to say that the 
hybridization conditions described above are to be adapted according to the length of the 
desired nucleic acid, following techniques well known to the one skilled in the art. The suitable 
hybridization conditions may for example be adapted according to the teachings disclosed in the 

15 book of Hames and Higgins (1985)or in Sambrook et al.(1989). 

Genomic Sequences of g3 5030 polynucleotides 

The present invention concerns genomic DNA sequences of the g35030 gene. As 
referred to herein, genomic sequences of g35030 are indicated by nucleotide position in the 5 f to 

20 3* orientation on SEQ ID No I. 

Particularly preferred nucleic acids of the invention include isolated, purified, or 
recombinant polynucleotides comprising, consisting essentially of, or consisting of a contiguous 
span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, or 1000 
nucleotides of nucleotide positions 20 1 1 23 to 247802 of SEQ ID No 1, or the complements 

25 thereof. Also encompassed are purified, isolated, or recombinant polynucleotides comprising a 

nucleotide sequence having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with 
nucleotide positions 201 123 to 247802 of SEQ ID No l s or a complementary sequence thereto 
or a fragment thereof. Also encompassed are purified, isolated, or recombinant nucleic acids 
that hybridize with said span under stringent hybridization conditions as defined above. Nucleic 

30 acids of the invention encompass g35030 nucleic acids from any source, including primate, 

non-human primate, mammalian and human. 

The human g35030 gene comprises exons selected from at least 14 different exons or 
exon forms, referred to herein as exons S, S2, T, U, V, VI, V2, V3, V4, W, W2, X, Y, Z. As 
indicated in Table 1, exons V and V2 are considered alternative exons, containing overlapping 

35 sequence. Likewise, exon W2 contains overlapping sequence with exons W and X. 
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The nucleotide positions of g35030 exons in SEQ ID No. 1 are detailed below in Table 
1. The exon structure of the g35030 gene is further shown in Figure 1 . 

Table 1 



Exon 


Position in SEQID1 


Intron Position in SEQIDl 


S 


201123 


201234 


201235 


201122 


S2 


201123 


201560 


201561 


214675 


T 


214676 


214793 


214794 


215701 


U 


215702 


215746 


215747 


216835 


V 


216836 


216994 


216995 


217670 


V2 


216836 


217077 


217078 


217670 


VI 


217671 


217764 


217765 


227654 


V4 


227655 


227736 


227737 


238714 


, V3 


238715 


238919 . 


238920 


240439 


W 


240440 


240673 


240674 


244352 or 24 1071 


W2 


240440 


24U53 


241154 


244352 


X 


241072 


241291 


241292 


244352 


Y 


244353 


244561 


244562 


246272 


Z 


246273 


247802 







5 Thus, the invention embodies purified, isolated, or recombinant polynucleotides 

comprising a nucleotide sequence selected from the group consisting of the exons of the g35030 
gene, or a sequence complementary thereto. Preferred are purified, isolated, or recombinant 
polynucleotides comprising at least one exon having the nucleotide position ranges listed in 
Table 1 selected from the group consisting of the exons S, S2, T, U, V, VI, V2, V3, V4, W, 

10 W2, X, Y, Z of the g35030 gene, or a complementary sequence thereto or a fragment or a 

variant thereof. Also encompassed by the invention are purified, isolated, or recombinant 
nucleic acids comprising a combination of at least two exons of the g35030 gene selected from 
the group consisting of exons S, S2, T, U, V, VI , V2, V3, V4, W, W2, X, Y, Z, wherein the 
polynucleotides are arranged within the nucleic acid in the same relative order as in SEQ ED No. 

15 1. 1 

Particularly preferred nucleic acids of the invention include isolated, purified, or 
recombinant polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 
40, 45, 50, 55, 60, 65, 70, 75, 80, 90, 100 or 200 nucleotides of SEQ ID No 1, wherein said 
contiguous span comprises at least 1 , 2, 3, 5, or 10 of the following nucleotide positions of SEQ 

20 ID No 1: 201123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 
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to 216994, 216836 to 2 17077, 217671 to 217764, 227655 to 227736, 23871 5 to 238919, 
240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561 and 246273 to 
247802, or the complements thereof. 

Another object of the invention consists of a purified, isolated, or recombinant nucleic 
5 acid that hybridizes with a g35030 nucleotide sequence of nucleotide positions 201 123 to 

201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 23871 5 to 238919, 240440 to 240673, 
240440 to 241153,241072 to 241291, 244353 to 244561 and 246273 to 247802 of SEQ ID No 
1, or a complementary sequence thereto or a variant thereof, under stringent hybridization 

1 0 conditions as defined above. 

The invention also encompasses a purified, isolated, or recombinant polynucleotide 
comprising a nucleotide sequence of g35030 having at least 70, 75, 80, 85, 90, or 95% 
nucleotide identity with a sequence selected from the group consisting of nucleotide positions 
201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 

15 216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919,240440 

to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561 and 246273 to 247802 of 
SEQ ID No. 1 or a complementary sequence thereto or a fragment thereof. The nucleotide 
differences as regards the nucleotide positions 201 123 to 201234, 201 123 to 201560, 21 4676 to 
214793, 215702 to 215746, 216836 to 216994, 216836 to 217077, 217671 to 217764, 227655 

20 to 227736, 238715 to 238919, 240440 to 240673, 240440 to 241153, 241072 to 241291, 

244353 to 244561 and 246273 to 247802 of SEQ ID No. 1 may generally be distributed 
throughout the nucleic acid. 

The present invention further embodies purified, isolated, or recombinant 
polynucleotides comprising a nucleotide sequence selected from the group consisting of the 

25 introns of the g35030 gene, or a sequence complementary thereto. The present invention thus 

encompasses the g35030 gene as well as g35O30 genomic sequences consisting of, consisting 
essentially of, or comprising the sequence of nucleotide positions 20 1 235 to 20 1 1 22, 20 1 56 1 to 
214675, 214794 to 215701, 215747 to 216835, 216995 to 217670, 217078 to 217670, 217765 
to 227654, 227737 to 238714, 238920 to 240439, 240674 to 244352, 240674 to 241071, 

30 241 154 to 244352, 241292 to 244352 and 244562 to 246272 of SEQ ID No 1, a sequence 

complementary thereto, as well as fragments and variants thereof. 

These nucleic acids, as well as their fragments and variants, may be used as 
oligonucleotide primers or probes in order to detect the presence of a copy of a gene comprising 
a g35030 nucleic acid sequence in a test sample, or alternatively in order to amplify a target 

35 nucleotide sequence within a g35030 nucleic acid sequence or adjoining region. 
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Additional preferred nucleic acids of the invention include isolated, purified, or 
recombinant g35030 polynucleotides comprising a contiguous span of at least 12, 1 5, 1 8, 20, 
25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 90, 100 or 200 nucleotides of nucleotide positions 
201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 
5 216994, 216836 to 217077, 217671 to 21 7764, 227655 to 227736, 238715 to 238919, 240440 

to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 
201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 199122 to 249803 of SEQ ID No 
1, or the complements thereof, wherein said contiguous span comprises at least one biallelic 
marker. Optionally, said contiguous span comprises a g3503O-related biallelic marker. It should 
10 be noted that nucleic acid fragments of any size and sequence may also be comprised by the 

polynucleotides described in this section. Either the original or the alternative allele may be 
present at said biallelic marker. 

As noted above, g35O30 gene sequence are indicated by position on SEQ ID Nol . 
Accordingly, in certain embodiments, a g3503O nucleotide sequence may also comprise 
15 isolated, purified, or recombinant polynucleotides comprising a contiguous span of at least 12, 

15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 1000 or 2000 nucleotides of 
nucleotide positions 3 1 to 3 19608 of SEQ ID No. 1 , or the complements thereof. Optionally, 
said contiguous span comprises a biallelic marker selected from the group consisting of A13 to 
Al 8, A20 to A47, A50 to A53, A56, A58, and A60 to A64. Optionally allele 2 is present at the 
20 biallelic marker. It should be noted that nucleic acid fragments of any size and sequence may 

be comprised by the polynucleotides described in this section. 

Further nucleic acids of the invention include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 
70, 80, 90, 100, 150, 200, 300, 400 or 500 nucleotides, to the extent that the length of said span 
25 is consistent with the length of the SEQ ID, of SEQ ID Nos. 24, 25, 28 and 74. The invention 

also encompasses a purified, isolated, or recombinant polynucleotide comprising a nucleotide 
sequence having at least 70, 75, 80, 85, 90, or 95% nucleotide identity to said span, as well as a 
purified, isolated, or recombinant nucleic acid that hybridizes with a said span under the 
stringent hybridization conditions as defined above. Optionally, said contiguous span comprises 
30 ar biallelic marker selected from the group consisting of A73 to A74, A77 and A124. 

Optionally allele 2 is present at the biallelic marker. It should be noted that nucleic acid 
fragments of any size and sequence may be comprised by the polynucleotides described in this 
section. 

Yet further nucleic acids of the invention include isolated, purified, or recombinant 
35 polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 
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70, 80, 90, 100, 150, 200, 500 or 1000 nucleotides, to the extent that said span is consistent with 
the nucleotide position range, of SEQ ID No 1, wherein said contiguous span comprises at least 
1 , 2, 3, 5, or 10 of the following nucleotide positions of SEQ ID No 1: 201188 to 20 1234, 
216836 to 216915 and 246273 to 247781, or the complements thereof, as well as 
5 polynucleotides having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with said span, 

and polynucleotides capable of hybridizing with said span. In other embodiments, said 
contiguous span may specifically exclude one or more of the following nucleotide positions of 
SEQ ID No. 1: 201 188 to 201234, 216836 to 216915, 214676 to214793 and 215702 to 
215746. 

10 The present invention also comprises a purified or isolated nucleic acid encoding a 

g35030 protein having the amino acid sequence of any one of SEQ ID Nos 18 to 23 or a peptide 
fragment or variant thereof. 

It should be noted that nucleic acid fragments of any size and sequence may also be 
comprised by the polynucleotides described in this section. 

1 5 While this section is entitled "Genomic Sequences of g35030," it should be noted that 

nucleic acid fragments of any size and sequence may also be comprised by the polynucleotides 
described in this section, flanking the genomic sequences g35030 on either side or between two 
or more such genomic sequences. 

g35030 cDNA Sequences 

20 The expression of the g35Q30 gene has been shown to lead to the production of several 

mRNA species. Several cDNA sequences corresponding to these mRNA are set forth in SEQ 
IDNos2tol7. 

The invention encompasses a purified, isolated, or recombinant nucleic acid comprising 
a nucleotide sequence selected from the group consisting of SEQ ID Nos 2 to 1 7, 

25 complementary sequences thereto, splice variants thereof, as well as allelic variants, and 

fragments thereof. Moreover, preferred polynucleotides of the invention include purified, 
isolated, or recombinant g35030 cDNAs consisting of, consisting essentially of, or comprising a 
nucleotide sequence selected from the group consisting of SEQ ID Nos 2 to 1 7. Particularly 
preferred nucleic acids of the invention include isolated, purified, or recombinant 

30 polynucleotides comprising a contiguous span of at least 8, 12, 1 5, 1 8, 20, 25, 30, 35, 40, 50, 

60, 70, 75, 80, 100, 200, 500 or 1000 nucleotides, to the extent that the length of said 
contiguous span is consistent with the length of the SEQ ID, of a nucleotide sequence selected 
from the group consisting of SEQ ID Nos 2 to 1 7, or the complements thereof. 

It should be noted that nucleic acid fragments of any size and sequence may also be 
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comprised by the polynucleotides described in this section. 

The invention also pertains to a purified or isolated nucleic acid comprising a 
polynucleotide having at least 70, 80, 85, 90 or 95% nucleotide identity with a polynucleotide 
selected from the group consisting of SEQ ID Nos 2 to 17, advantageously 99% nucleotide 
identity, preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide identity 
with a polynucleotide selected from the group consisting of SEQ ID Nos 2 to 1 7, or a sequence 
complementary thereto or a biologically active fragment thereof. 

Another object of the invention relates to purified, isolated or recombinant nucleic acids 
comprising a polynucleotide that hybridizes, under the stringent hybridization conditions 
defined herein, with a polynucleotide selected from the group consisting of SEQ ID Nos 2 to 1 7, 
or a sequence complementary thereto or a variant thereof or a biologically active fragment 
thereof. 

The g35030 cDNA forms of SEQ ID Nos 2 to' 1 7 are further described in Table 2 
below. Shown on the Table 2 are the positions of the 5' UTR, the open reading frame (ORF), 
the 3' UTR and the poiyA signal on the respective SEQ ID No. Also shown are the g35030 
exons comprising the cDNA form of a particular SEQ ID No. 



Table 2 



cDNA 


SEQ ED 


Position range 


Position range 


Position range 


Position range 




No of 


of5 f UTR in 


of ORF in 


of 3' UTR in 


of polyA signal 




cDNA 


SfcQ ID 


SEQ ID 


SEQID 


in SEQ H> 


VV1WYZ 


2 




95 


96 


167 


168 


2226 


2200 


2205 


W1V3WYZ 


3 




95 


96 


167 


168 


2431 


2405 


2410 


V2WYZ 


4 




95 


96 


311 


312 


2215 


2189 


2194 


VWYZ 


5 




95 


96 


212 


213 


2132 


2106 


2111 


V2YZ 


6 




95 


96 


260 


261 


1981 


1955 


1960 


W1YZ 


7 




95 


96 


167 


168 


1992 


1966 


1971 


VYZ 


8 




95 


96 


182 


183 


1898 


1872 


1877 


W1Z 


9 




95 


96 


167 


168 


1783 


1757 


1762 


V2V1WYZ 


10 




95 


96 


311 


312 


2309 


2283 


2288 


V2V1YZ 


11 




95 


96 


311 


312 


2075 


2049 


2054 


VWZ 


12 




95 


96 


212 


213 


1923 


1897 


1902 


W1V4WYZ 


13 




95 


96 


167 


168 


2308 


2282 


2287 


VV1W2YZ 


14 




95 


96 


167 


168 


2706 


2680 


2685 


VW2YZ 


15 




95 


96 


212 


213 


2612 


2586 


2591 



WO 01/40493 



29 



PCT/IBOO/01507 



STUY 


16 


1 


370 


371 


436 


437 


434 






S2TUV 


17 


1 


696 


697 


762 


763 


760 







The present invention thus further encompasses variant g35030 polynucleotides having 
at least one nucleotide substitution, insertion or deletion as described further herein. In 
preferred embodiments described in Table 3, the invention comprises g3503O polynucleotides 

5 comprising a variation as described herein by biallelic markers 8-126-286 (also referred to as 8- 

130-143) and 8-155-258. The invention thus encompasses purified, isolated, or recombinant 
polynucleotides and polypeptides encoded thereby, wherein the polynucleotides comprise a 
contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 60, 70, 80, 100, 150, or 200 
nucleotides of SEQ ID No 2 to 17 or the complement thereof, wherein said contiguous span 

10 further comprises nucleotide position 132 of any of SEQ ID Nos 2 to 1 5 and 17, or nucleotide 

position 407 of SEQ ID No 16. Optionally, an A is present at said nucleotide position; 
optionally a G is present at said nucleotide position. In a further embodiment, said contiguous 
span comprises nucleotide position 328 or SEQ ID 3, or the complement thereof, and optionally 
a C is present at said position, or optionally a T is present at said position. 

15 Table 3 



SEQ ID 


cDNA form 


Polymorphism 


Alleles 


Position in 


Position in SEQ 


Position 


No 




reference 




SEQ ID 


ID No of cDNA 


in amino 










Nol 




acid seq 


2 


V1WYZ 


8-126-286/ 


A 


G 


216967 


132 


13 






8-130-143 












3 


W1V3WYZ 










132 


13 


4 


V2WYZ 










132 


13 


5 


VWYZ 










132 


13 


6 


V2YZ 










132 


13 


7 


W1YZ 










132 


13 


8 


VYZ 










132 


13 


9 


W1Z 










132 


13 


10 


V2V1WYZ 










132 


13 


11 


V2V1YZ 










132 


13 


12 


vwz 










132 


13 


13 


W1V4WYZ 










132 


13 


14 


W1W2YZ 










132 


13 


15 


VW2YZ 










132 


13 


16 


STUV 










•407 


13 


17 


S2TUV 










733 


13 


3 


W1V3WYZ 


8-155-258 


C 


T 


238789 


W1V3WYZ:328 





While this section is entitled "g35030 cDNA Sequences," it should be noted that nucleic 
acid fragments of any size and sequence may also be comprised by the polynucleotides 
described in this section, flanking the genomic sequences of g35030 on either side or between 
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two or more such genomic sequences. 

G35030 Coding Regions 

The g35030 open reading frame is contained in the corresponding mRNA of a cDNA 
sequence selected from the group consisting of SEQ ID Nos 2 to 1 7. The effective g35030 
5 coding sequence (CDS) may include several forms as indicated above, in some embodiments 

encompassing isolated, purified, and recombinant polynucleotides which encode a polypeptide 
comprising a contiguous span of at least 4 amino acids, preferably 6, more preferably at least 8 
or 10 amino acids, yet more preferably at least 12, 15,20,25, 30,40, 50, or 100 amino acids of 
SEQ ID Nos 1 8 to 23. The effective g35030 coding sequence (CDS) may comprise the region 

10 between the first nucleotide of the ATG codon and the end nucleotide of the stop codon of SEQ 

ID Nos 2 to 17 as indicated in Table 2 above. 

The above disclosed polynucleotide that contains the coding sequence of the g35030 
gene may be expressed in a desired host ceil or a desired host organism when this 
polynucleotide is placed under the control of suitable expression signals. The expression 

15 signals may be either the expression signals contained in the regulatory regions in the g35030 

gene of the invention or in contrast the signals may be exogenous regulatory nucleic sequences. 
Such a polynucleotide, when placed under the suitable expression signals, may also be inserted 
in a vector for its expression and/or amplification. 

Regulatory Sequences Of g35030 
20 As mentioned, the genomic sequence of the g35030 gene contains regulatory sequences 

both in the non-coding 5'-flanking region and in the non-coding 3'-flanking region that border 
the g3503O coding region containing the exons of the gene. 

Polynucleotides derived from the 5* and 3' regulatory regions are useful in order to 
detect the presence of at least a copy of a g3503O nucleotide sequence of SEQ ID No 1 or a 
25 fragment thereof in a test sample. 

The promoter activity of the 5 ' regulatory regions contained in g3 503 0 can be assessed 
as described below. 

In order to identify the relevant biologically active polynucleotide fragments or variants 
of a g3503O regulatory region, one of skill in the art will refer to Sam brook et al(1989) s which 
30 describes the use of a recombinant vector carrying a marker gene (i.e. beta galactosidase, 

chloramphenicol acetyl transferase, etc.) the expression of which will be detected when placed 
under the control of a biologically active polynucleotide fragment or variant of the g35030 
sequence of SEQ ID No 1 . Genomic sequences located upstream of the first exon of the g35030 
gene are cloned into a suitable promoter reporter vector, such as the pSEAP-Basic, pSEAP- 
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Enhancer, ppgal-Basic, ppgal-Enhancer, or pEGFP-1 Promoter Reporter vectors available from 
Clontech, or pGL2-basic or pGL3 -basic promoterless luciferase reporter gene vector from 
Promega. Briefly, each of these promoter reporter vectors include multiple cloning sites 
positioned upstream of a reporter gene encoding a readily assayable protein such as secreted 
5 alkaline phosphatase, luciferase, p galactosidase, or green fluorescent protein. The sequences 

upstream of the g3503O coding region are inserted into the cloning sites upstream of the reporter 
gene in both orientations and introduced into an appropriate host cell. The level of reporter 
protein is assayed and compared to the level obtained from a vector which lacks an insert in the 
cloning site. The presence of an elevated expression level in the vector containing the insert 

1 0 with respect to the control vector indicates the presence of a promoter in the insert. If 

necessary, the upstream sequences can be cloned into vectors which contain an enhancer for 
increasing transcription levels from weak promoter sequences. A significant level of expression 
above that observed with the vector lacking an insert indicates that a promoter sequence is 
present in the inserted upstream sequence. 

1 5 Promoter sequence within the upstream genomic DNA may be further defined by 

constructing nested 5* and/or 3 5 deletions in the upstream DNA using conventional techniques 
such as Exonuclease III or appropriate restriction endonuclease digestion. The resulting 
deletion fragments can be inserted into the promoter reporter vector to determine whether the 
deletion has reduced or obliterated promoter activity, such as described, for example, by Coles 

20 et al.(1998). In this way, the boundaries of the promoters may be defined. If desired, potential 

individual regulatory sites within the promoter may be identified using site directed mutagenesis 
or linker scanning to obliterate potential transcription factor binding sites within the promoter 
individually or in combination. The effects of these mutations on transcription levels may be 
determined by inserting the mutations into cloning sites in promoter reporter vectors. This type 

25 of assay is well-known to those skilled in the art and is described in WO 97/17359, US Patent 

No. 5,374,544; EP 582 796; US Patent No. 5,698,389; US 5,643,746; US Patent No. 5,502, 1 76; 
and US Patent 5,266,488. 

The strength and the specificity of the promoter of the g35030 gene can be assessed 
through the expression levels of a detectable polynucleotide operably linked to the g35O30 

30 promoter in different types of cells and tissues. The detectable polynucleotide may be either a 

polynucleotide that specifically hybridizes with a predefined oligonucleotide probe, or a 
polynucleotide encoding a detectable protein, including a g35O30 polypeptide or a fragment or a 
variant thereof. This type of assay is well-known to those skilled in the art and is described in 
US Patent No. 5,502,176; and US Patent No. 5,266,488. Some of the methods are discussed in 

35 more detail below. 
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Polynucleotides carrying the regulatory elements located at the 5' end and at the 3* end 
of the g35030 coding region may be advantageously used to control the transcriptional and 
translational activity of an heterologous polynucleotide of interest. 

Thus, the present invention also concerns a purified or isolated nucleic acid comprising 
5 a polynucleotide which is selected from the group consisting of the 5' and 3 9 regulatory regions 

of g35030, or a sequence complementary thereto or a biologically active fragment or variant 
thereof. In one aspect, "5' regulatory region" may comprise the nucleotide sequence located 
between positions 199122 to 201 122 of SEQ ID No 1. In one aspect, "3' regulatory region" 
may comprise the nucleotide sequence located between positions 247803 to 249803 of SEQ ID 
10 Nol. 

The invention also pertains to a purified or isolated nucleic acid comprising a 
polynucleotide having at least 95% nucleotide identity with a polynucleotide selected from the 
group consisting of the 5' and V regulatory regions, advantageously 99 % nucleotide identity, 
preferably 99.5% nucleotide identity and most preferably 99.8% nucleotide identity with a 

15 polynucleotide selected from the group consisting of the 5' and V regulatory regions, or a 

sequence complementary thereto or a variant thereof or a biologically active fragment thereof 

Another object of the invention consists of purified, isolated or recombinant nucleic 
acids comprising a polynucleotide that hybridizes, under the stringent hybridization conditions 
defined herein, with a polynucleotide selected from the group consisting of the nucleotide 

20 sequences of the 5*- and 3' regulatory regions of g35030, or a sequence complementary thereto 

or a variant thereof or a biologically active fragment thereof. 

Preferred fragments of the 5* regulatory region have a length of about 1500 or 1000 
nucleotides, preferably of about 500 nucleotides, more preferably about 400 nucleotides, even 
more preferably 300 nucleotides and most preferably about 200 nucleotides. 

25 Preferred fragments of the 3' regulatory region are at least 50, 100, 1 50, 200, 300 or 

400 bases in length. 

"Biologically active" g3503O regulatory polynucleotide derivatives of SEQ ID No 1 are 
polynucleotides comprising or alternatively consisting in a fragment of said polynucleotide 
which is functional as a regulatory region for expressing a recombinant polypeptide or a 
30 recombinant polynucleotide in a recombinant cell host. It could act either as an enhancer or as a 

repressor. 

For the purpose of the invention, a nucleic acid or polynucleotide is "functional" as a 
regulatory region for expressing a recombinant polypeptide or a recombinant polynucleotide if 
said regulatory polynucleotide contains nucleotide sequences which contain transcriptional and 
35 translational regulatory information, and such sequences are "operably linked" to nucleotide 
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sequences which encode the desired polypeptide or the desired polynucleotide. 

The regulatory polynucleotides of the invention may be prepared from the nucleotide 
sequence of SEQ ID No 1 by cleavage using suitable restriction enzymes, as described for 
example in Sambrook etal.(1989). The regulatory polynucleotides may also be prepared by 
5 digestion of SEQ ID No 1 by an exonuclease enzyme, such as BaI31 (Wabiko et al., 1986). 

These regulatory polynucleotides can also be prepared by nucleic acid chemical synthesis, as 
described elsewhere in the specification. 

The g3503O regulatory polynucleotides according to the invention may be part of a 
recombinant expression vector that may be used to express a coding sequence in a desired host 
1 0 cell or host organism. The recombinant expression vectors according to the invention are 

described elsewhere in the specification. 

A preferred 5'-regulatory polynucleotide of the invention includes the 5 '-untranslated 
region (5'-UTR) of the g35030 cDNA, or a biologically active fragment or variant thereof. 

A preferred 3 '-regulatory polynucleotide of the invention includes the 3'-untranslated 
1 5 region (3*-UTR) of the g35030 cDNA, or a biologically active fragment or variant thereof. 

A further object of the invention consists of a purified or isolated nucleic acid 
comprising: 

a) a nucleic acid comprising a regulatory nucleotide sequence selected from the group 
consisting of: 

20 (i) a nucleotide sequence comprising a polynucleotide of the g35030 5* regulatory region or a 

complementary sequence thereto; 

(ii) a nucleotide sequence comprising a polynucleotide having at least 95% of nucleotide 
identity with the nucleotide sequence of the g35030 5' regulatory region or a complementary 
sequence thereto; 

25 (iii) a nucleotide sequence comprising a polynucleotide that hybridizes under stringent 

hybridization conditions with the nucleotide sequence of the g35030 5* regulatory region or a 
complementary sequence thereto; and 

(iv) a biologically active fragment or variant of the polynucleotides in (i), (ii) and (iii); 

b) a polynucleotide encoding a desired polypeptide or a nucleic acid of interest, 
30 operably linked to the nucleic acid defined in (a) above; and 

c) optionally, a nucleic acid comprising a 3'- regulatory polynucleotide, preferably a 3'- 
regulatory polynucleotide of the g35030 gene. 

In a specific embodiment of the nucleic acid defined above, said nucleic acid includes 
the 5 '-untranslated region (5'-UTR) of the g3503O cDNA, or a biologically active fragment or 
35 variant thereof . 
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In a second specific embodiment of the nucleic acid defined above, said nucleic acid 
includes the 3 '-untranslated region (3'~UTR) of the g35030 cDNA, or a biologically active 
fragment or variant thereof. 

The regulatory polynucleotide of the 5' regulatory region, or its biologically active 
5 fragments or variants, is operably linked at the 5 s -end of the polynucleotide encoding the 

desired polypeptide or polynucleotide. 

The regulatory polynucleotide of the 3' regulatory region, or its biologically active 
fragments or variants, is advantageously operably linked at the 3'-end of the polynucleotide 
encoding the desired polypeptide or polynucleotide. 
1 0 The desired polypeptide encoded by the above-described nucleic acid may be of various 

nature or origin, encompassing proteins of prokaryotic oreukaryotic origin. Among the 
polypeptides expressed under the control of a g35030 regulatory region include bacterial, fungal 
or viral antigens. Also encompassed are eukaryotic proteins such as intracellular proteins, like 
"house keeping" proteins, membrane-bound proteins, like receptors, and secreted proteins like 
1 5 endogenous mediators such as cytokines. The desired polypeptide may be the g35030 protein, 

especially the protein of the amino acid sequences of SEQ ID Nos 18 to 23, or a fragment or a 
variant thereof. 

The desired nucleic acids encoded by the above-described polynucleotide, usually an 
RNA molecule, may be complementary to a desired coding polynucleotide, for example to the 
20 g35030 coding sequence, and thus useful as an antisense polynucleotide. 

Such a polynucleotide may be included in a recombinant expression vector in order to 
express the desired polypeptide or the desired nucleic acid in host cell or in a host organism. 
Suitable recombinant vectors that contain a polynucleotide such as described herein are 
disclosed elsewhere in the specification. 

25 

Polynucleotide Constructs 

The terms " polynucleotide construct " and " recombinant polynucleotide" are used 
interchangeably herein to refer to linear or circular, purified or isolated polynucleotides that 
have been artificially designed and which comprise at least two nucleotide sequences that are 
30 not found as contiguous nucleotide sequences in their initial natural environment. It should be 

noted that the present invention embodies recombinant vectors comprising any one of the 
polynucleotides described in the present invention. 

DNA Constructs that Enables Directing Temporal and Spatial Expression of 
35 g35030 Nucleic Acid Sequences in Recombinant Ceil Hosts and in Transgenic Animals 
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In order to study the physiological and phenotypic consequences of a lack of synthesis 
of a protein encoded by a nucleotide sequence comprising a g35030 polynucleotide, both at the 
cell level and at the multi cellular organism level, the invention also encompasses DNA 
constructs and recombinant vectors enabling a conditional expression of a specific allele of a 
5 nucleotide sequence comprising a g35030 polynucleotide and also of a copy of a sequence 

comprising a nucleotide sequence of a g35030 polynucleotide, or a fragment thereof, harboring 
substitutions, deletions, or additions of one or more bases. These base substitutions, deletions 
or additions may be located either in an exon, an intron or a regulatory sequence, in particular a 
5' regulatory sequence of a g35030 polynucleotide. In a preferred embodiment, the nucleotide 
10 sequence comprising a g35030 polynucleotide further comprises a biallelic marker of the 

present invention. 

A first preferred DNA construct is based on the tetracycline resistance operon tet from 
E. coli transposon Tnl 10 for controlling the expression of a g35030 polynucleotide, such as 
described by Gossen et al. (1992, 1995) and Furth et al.(1994). Such a DNA construct contains 

15 seven tet operator sequences from TnlO (tetop) that are fused to either a minimal promoter or a 

5-reguIatory sequence of the g35030 polynucleotide, said minimal promoter or said g35030 
polynucleotide regulatory sequence being operably linked to a polynucleotide of interest that 
codes either for a sense or an antisense oligonucleotide or for a polypeptide, including a g35030 
polynucleotide-encoded polypeptide or a peptide fragment thereof This DNA construct is 

20 functional as a conditional expression system for the nucleotide sequence of interest when the 

same cell also comprises a nucleotide sequence coding for either the wild type (tTA) or the 
mutant (rTA) repressor fused to the activating domain of viral protein VP 16 of herpes simplex 
virus, placed under the control of a promoter, such as the HCMVIE1 enhancer/promoter or the 
MMTV-LTR. Indeed, a preferred DNA construct of the invention comprises both the 

25 polynucleotide containing the tet operator sequences and the polynucleotide containing a 

sequence coding for the tTA or the rTA repressor. 

In a specific embodiment, the conditional expression DNA construct contains the 
sequence encoding the mutant tetracycline repressor rTA, the expression of the polynucleotide 
of interest is silent in the absence pf tetracycline and induced in its presence. 

30 DNA Constructs Allowing Homologous Recombination: Replacement Vectors 

A second preferred DNA construct will comprise, from 5 f -end to 3'-end: (a) a first 
nucleotide sequence comprising a g35030 polynucleotide; (b) a nucleotide sequence 
comprising a positive selection marker, such as the marker for neomycine resistance (neo); and 
(c) a second nucleotide sequence comprising a respective g35030 polynucleotide, and is located 

35 on the genome downstream of the first g3503O polynucleotide sequence (a). 
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In a preferred embodiment, this DNA construct also comprises a negative selection 
marker located upstream the nucleotide sequence (a) or downstream the nucleotide sequence 
5 (c). Preferably, the negative selection marker comprises the thymidine kinase (tk) gene 

(Thomas et a!., 1986), the hygromycine beta gene (Te RJele et al., 1990), the hprt gene ( Van 
der Lugt et al., 1991; Reid et al., 1990) or the Diphteria toxin A fragment {Dt-A) gene (Nada et 
al., 1993; Yagi et al.1990). Preferably, the positive selection marker is located within and exon 
of a g35030 polynucleotide so as to interrupt the sequence, encoding the g35030 protein. These 
10 replacement vectors are described, for example, by Thomas et al.(1986; 1987), Mansour et 

aI.(198S) and Roller etal.(1992). 

The first and second nucleotide sequences (a) and (c) may be indifferently located 
within a g35030 polynucleotide regulatory sequence, an intronic sequence, an exon sequence or 
a sequence containing both regulatory and/or intronic and/or exon sequences. The size of the 
15 nucleotide sequence of (a) and (c) ranges from 1 to 50 kb, preferably from 1 to 10 kb, more 

preferably from 2 to 6 kb and most preferably from 2 to 4 kb. 

DNA Constructs Allowing Homologous Recombination: Cre-LoxP System. 
These new DNA constructs make use of the site specific recombination system of the 
PI phage. The PI phage possesses a recombinase called Cre Which interacts specifically with a ' 
20 34 base pairs loxP site. The loxP site is composed of two palindromic sequences of 13 bp 

separated by a 8 bp conserved sequence (Ho.ess et al., 1986). The recombination by the Cre 
enzyme between two ioxP sites having an identical orientation leads to the deletion of the DNA 
fragment. 

The Cre-ZoxP system used in combination with a homologous recombination technique 
25 has been first described by Gu et al.(l 993, 1994). Briefly, a nucleotide sequence of interest to 

be inserted in a targeted location of the genome harbors at least two loxP sites in the same 
orientation and located at the respective ends of a nucleotide sequence to be excised from the 
recombinant genome. The excision event requires the presence of the recombinase (Cre) 
enzyme within the nucleus of the recombinant cell host. The recombinase enzyme may be 
30 brought at the desired time either by (a) incubating the recombinant cell hosts in a culture 

medium containing this enzyme, by injecting the Cre enzyme directly into the desired cell, such 
as described by Araki et al.(1995), or by Hpofection of the enzyme into the cells, such as 
described by Baubonis et al.(1993); (b) transfecting the cell host with a vector comprising the 
Cre coding sequence operably linked to a promoter functional in the recombinant cell host, 
35 which promoter being optionally inducible, said vector being introduced in the recombinant cell 
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host, such as described by Gu et al.(1993) and Sauer et al.(1988); ( c ) introducing in the genome 
of the cell host a polynucleotide comprising the Cre coding sequence operably linked to a 
promoter functional in the recombinant cell host, which promoter is optionally inducible, and 
said polynucleotide being inserted in the genome of the cell host either by a random insertion 
5 event or an homologous recombination event, such as described by Gu et al.(1994). 

In a specific embodiment, the vector containing the sequence to be inserted in a g35030 
gene sequence by homologous recombination is constructed in such a way that selectable 
markers are flanked by lox? sites of the same orientation, it is possible, by treatment by the Cre 
enzyme, to eliminate the selectable markers while leaving the g35030 polynucleotide sequences 

10 of interest that have been inserted by an homologous recombination event. Again, two 

selectable markers are needed: a positive selection marker to select for the recombination event 
and a negative selection marker to select for the homologous recombination event. Vectors and 
methods using the Cre-lox? system are described by Zou et al.(1994). 

Thus, in one aspect, a further preferred DNA construct of the invention comprises, from 

1 5 5*-end to 3-end: (a) a first nucleotide sequence that is comprised by a g35030 polynucleotide; 

(b) a nucleotide sequence comprising a polynucleotide encoding a positive selection marker, 
said nucleotide sequence comprising additionally two sequences defining a site recognized by a 
recombinase, such as a lox? site, the two sites being placed in the same orientation; and (c) a 
second nucleotide sequence comprising a g35030 polynucleotide, and is located on the genome 

20 downstream of the first g3503O polynucleotide sequence (a). 

The sequences defining a site recognized by a recombinase, such as a lox? site, are 
preferably located within the nucleotide sequence (b) at suitable locations bordering the 
nucleotide sequence for which the conditional excision is sought. In one specific embodiment, 
two lox? sites are located at each side of the positive selection marker sequence, in order to 

25 allow its excision at a desired time after the occurrence of the homologous recombination event. 

In a preferred embodiment of a method using the third DNA construct described above, 
the excision of the polynucleotide fragment bordered by the two sites recognized by a 
recombinase, preferably two loxP sites, is performed at a desired time, due to the presence 
within the genome of the recombinant host cell of a sequence encoding the Cre enzyme 

30 operably linked to a promoter sequence, preferably an inducible promoter, more preferably a 

tissue-specific promoter sequence and most preferably a promoter sequence which is both 
inducible and tissue-specific, such as described by Gu et al.(1994). 

The presence of the Cre enzyme within the genome of the recombinant cell host may 
result from the breeding of two transgenic animals, the first transgenic animal bearing the 

35 g3 5030 polynucleotide -derived sequence of interest containing the lox? sites as described 
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above and the second transgenic animal bearing the Cre coding sequence operably linked to a 
suitable promoter sequence, such as described by Gu et al.(l 994). 

Spatio-temporal control of the Cre enzyme expression may also be achieved with an 
adenovirus based vector that contains the Cre gene thus allowing infection of cells, or in vivo 
5 infection of organs, for delivery of the Cre enzyme, such as described by Anton and Graham 

(1995) and Kanegae et al.(l 995). 

The DNA constructs described above may be used to introduce a desired nucleotide 
sequence of the invention, preferably a g35030 polynucleotide, and most preferably an altered 
copy of a g35030 polynucleotide sequence, within a predetermined location of the targeted 
10 genome, leading either to the generation of an altered copy of a targeted gene (knock-out 

homologous recombination) or to the replacement of a copy of the targeted gene by another 
copy sufficiently homologous to allow an homologous recombination event to occur (knock-in 
homologous recombination). In a specific embodiment, the DNA constructs described above 
may be used to introduce a g35030 polynucleotide. 
15 Nuclear Antisense DNA Constructs 

Other compositions containing a vector of the invention comprise an oligonucleotide 
fragment of the g35030 polynucleotide sequences of SEQ ID No.l respectively, as an antisense 
tool that inhibits the expression of the corresponding gene. Preferred methods using antisense 
polynucleotide according to the present invention are the procedures described by Sczakiel et 
20 al.(1995) or those described in PCT Application No WO 95/24223. 

Preferably, the antisense tools are chosen among the polynucleotides (15-200 bp long) 
that are complementary to the 5'end of a g35030 polynucleotide mRNA. In one embodiment, a 
combination of different antisense polynucleotides complementary to different parts of the 
desired targeted gene are used. 
25 Preferably, the antisense polynucleotides of the invention have a 3' polyadenylation 

signal that has been replaced with a self-cleaving ribozyme sequence, such that RNA 
polymerase II transcripts are produced without poly(A) at their 3' ends, these antisense 
polynucleotides being incapable of export from the nucleus, such as described by Liu et 
al.(1994). In a preferred embodiment, these g35030 antisense polynucleotides also comprise, 
30 within the ribozyme cassette, a histone stem-loop structure to stabilize cleaved transcripts 

against 3-5' exonucleolytic degradation, such as the structure described by Eckner et al.(1991); 
Constructs for Activation of g35030 gene 

The present invention also encompasses primary, secondary, and immortalized 
homoiogously recombinant host cells of vertebrate origin, preferably mammalian origin and 
35 particularly human origin, that have been engineered to: a) insert exogenous (heterologous) 
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polynucleotides into the endogenous chromosomal DNA of a targeted gene, b) delete 
endogenous chromosomal DNA, and/or c) replace endogenous chromosomal DNA with 
exogenous polynucleotides. Insertions, deletions, and/or replacements of polynucleotide 
sequences may'be to the coding sequences of the targeted gene and/or to regulatory regions, 
5 such as promoter and enhancer sequences, operably associated with the targeted gene. 

The present invention further relates to a method of making a homologously 
recombinant host cell in vitro or in vivo, wherein the expression of a targeted gene not normally 
expressed in the cell is altered. Preferably the alteration causes expression of the targeted gene 
under normal growth conditions or under conditions suitable for producing the polypeptide 

10 encoded by the targeted gene. The method comprises the steps of: (a) transfecting the cell in 

vitro or in vivo with a polynucleotide construct, the a polynucleotide construct comprising; (i) a 
targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired 
splice donor site, if necessary, thereby producing a transfected cell; and (b) maintaining the 
transfected cell in vitro or in vivo under conditions appropriate for homologous recombination. 

15 The present invention further relates to a method of altering the expression of a targeted 

gene in a cell in vitro or in vivo wherein the gene is not normally expressed in the cell, 
comprising the steps of: (a) transfecting the cell in vitro or in vivo with a a polynucleotide 
construct, the a polynucleotide construct comprising: (i) a targeting sequence; (ii) a regulatory 
sequence and/or a coding sequence; and (iii) an unpaired splice donor site, if necessary, thereby 

20 producing a transfected cell; and (b) maintaining the transfected cell in vitro or in vivo under 

conditions appropriate for homologous recombination, thereby producing a homologously 
recombinant cell; and (c) maintaining the homologously recombinant cell in vitro or in vivo 
under conditions appropriate for expression of the gene. 

The present invention further relates to a method of making a polypeptide of the present 

25 invention by altering the expression of a targeted endogenous gene in a cell in vitro or in vivo 

wherein the gene is not normally expressed in the cell, comprising the steps of: a) transfecting 
the cell in vitro with a a polynucleotide construct, the a polynucleotide construct comprising: (i) 
a targeting sequence; (ii) a regulatory sequence and/or a coding sequence; and (iii) an unpaired 
splice donor site, if necessary, thereby producing a transfected cell; (b) maintaining the 

30 transfected cell in vitro or in vivo under conditions appropriate for homologous recombination, 

thereby producing a homologously recombinant cell; and c) maintaining the homologously 
recombinant cell in vitro or in vivo under conditions appropriate for expression of the gene 
thereby making the polypeptide. 

The present invention, further relates to a a polynucleotide construct which alters the 

35 expression of a targeted gene in a cell type in which the gene is not normally expressed. This 
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occurs when the a polynucleotide construct is inserted into the chromosomal DNA of the target 
cell, wherein the a polynucleotide construct comprises: a) a targeting sequence; b) a regulatory 
sequence and/or coding sequence; and c) an unpaired splice-donor site, if necessary. Farther 
included are a polynucleotide constructs, as described above, wherein the contruct further 
5 comprises A a polynucleotide which encodes a polypeptide and is in-frame with the targeted 

endogenous gene after homologous recombination with chromosomal DNA. 

The compositions may be produced, and methods performed, by techniques known in 
the art, such as those described in U.S. Patent Nos: 6,054,288; 6,048,729; 6,048,724; 6,048,524; 
5,994,127; 5,968,502; 5,965,125; 5,869,239; 5,817,789; 5,783,385; 5,733,761; 5,641,670; 
10 5,580,734 ; International Publication Nos:W096/294 1 1, WO 94/12650; and scientific articles 

including 1994; Koller et.al., Proc. Nati. Acad. Sci. USA 86:8932-8935 (1989) (the disclosures 
of each of which are incorporated by reference in their entireties). 

Oligonucleotide Probes And Primers 

15 The polynucleotides of the invention are useful in order to detect the presence of at least 

a copy of a g35030 nucleotide sequence of SEQ ID Nos 1 to 17 or of the respective g35030 
polynucleotide or gene, or a fragment, complement, or variant thereof in a test sample. 

Particularly preferred probes and primers of the invention include isolated, purified, or 
recombinant polynucleotides comprising a contiguous span of at least 12, 1 5, 1 8, 20, 25, 30, 35, 

20 40, 50, 60, 70, 80, 90, 1 00, 150, 200, 500, 1 000 or 2000 nucleotides, to the extent that said span 

is consistent with the length of the nucleotide position range, of SEQ ID No 1, wherein said 
contiguous span comprises at least 1, 2, 3, 4, 5, 7 or 10 of the following nucleotide positions of 
SEQ ID No 1: 201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 
216836 to 216994, 2 1683 6 to 217077, 217671 to 2 17764, 227655 to 227736,238715 to 

25 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 

to 247802, 201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 199122 to 249803, or a 
complementary sequence thereto or a fragment thereof. 

Probes and primers of the invention also include isolated, purified, or recombinant 
polynucleotides having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with a contiguous 

30 span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 1000 or 

2000 nucleotides of nucleotide positions 201 123 to 201234, 201 123 to 201560, 214676 to 
214793, 21 5702 to 215746, 216836 to 216994, 216836 to 217077, 217671 to 217764, 227655 
to 227736, 238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 
244353 to 244561, 246273 to 247802, 201 123 to 247802, 199122 to 201 122, 247803 to 249803 

35 and 199 122 to 249803 of SEQ ID No. 1 . Preferred probes and primers of the invention also 
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include isolated, purified, or recombinant polynucleotides comprising a g35030 nucleotide 
sequence having at least 70, 75, 80, 85, 90, or 95% nucleotide identity with at least one 
sequence selected from the group consisting of the following nucleotide positions of SEQ ED 
No. 1 : 201 123 to 20 1234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 
5 216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 

to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 
201 323 to 247802, 199122 to 201 122, 247803 to 249803 and 199122 to 249803; or a 
complementary sequence thereto or a fragment thereof. 

The invention also relates to nucleic acid probes characterized in that they hybridize 

1 0 specifically, under the stringent hybridization conditions defined above, with a contiguous span 

of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 1000 or 2000 
nucleotides of nucleotide positions 201 123 to 201234, 201123 to 201560, 214676 to 214793, 
215702 to 215746, 216836 to 216994, 216836 to 217077, 217671 to 217764, 227655 to 
227736, 238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 

15 to 244561, 246273 to 247802, 201123 to247802, 199122 to 201122,247803 to 249803 and 

199122 to 249803 of SEQ ID No, 1, or a variant thereof or a sequence complementary thereto. 

Probes and primers of the invention further include isolated, purified, or recombinant 
polynucleotides comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 
70, 80, 90, 100, 150, 200, 500, 1000 or 2000 nucleotides, to the extent that said span is 

20 consistent with the length of the nucleotide position range, of SEQ ID No 2 to 17 or 79 to 132. 

Also included are isolated, purified, or recombinant polynucleotides having at least 70, 75, 80, 
85, 90, or 95% nucleotide identity with a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 
40, 50, 60, 70, 80, 90, 1 00, 1 50, 200, 500, 1 000 or 2000 nucleotides of SEQ ID No. 2 to 1 7 or 
79 to 132, or a complementary sequence thereto, as well as isolated, purified, or recombinant 

25 polynucleotides comprising a g35O30 nucleotide sequence having at least 70, 75, 80, 85, 90, or 

95% nucleotide identity with at least one sequence selected from the group consisting of the 
following nucleotide positions of SEQ ID No. 2 to 17 or 79 to 132; or a complementary 
sequence thereto or a fragment thereof. The invention also relates to nucleic acid probes 
characterized in that they hybridize specifically, under the stringent hybridization conditions 

30 defined above, with a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 

90, 100, 150, 200, 500, 1000 or 2000 nucleotides of nucleotide SEQ ID No. 2 to 17 or 79 to 
132, or a variant thereof or a sequence complementary thereto. 

The formation of stable hybrids depends on the melting temperature (Tm) of the DNA. 
The Tm depends on the length of the primer or probe, the ionic strength of the solution and the 

35 G+C content. The higher the G+C content of the primer or probe, the higher is the melting 



WO 01/40493 PCT/EBOO/01507 

42 

temperature because G:C pairs are held by three H bonds whereas A:T pairs have only two. 
The GC content in the probes of the invention usually ranges between 10 and 75 %, preferably 
between 35 and 60 %, and more preferably between 40 and 55 %. 

A probe or a primer according to the invention may be between 8 and 2000 nucleotides 
5 in length, or is specified to be at least 12, 15, 1 8, 20, 25, 35, 40, 50, 60, 70, 80, 100, 250, 500 , 

1 000 nucleotides in length. More particularly, the length of these probes can range from 8, 30, 
15, 20, or 30 to 100 nucleotides, preferably from 10 to 50, more preferably from 15 to 30 
nucleotides. Shorter probes tend to lack specificity for a target nucleic acid sequence and 
generally require cooler temperatures to form sufficiently stable hybrid complexes with the 

10 template. Longer probes are expensive to produce and can sometimes self-hybridize to form 

hairpin structures. The appropriate length for primers and probes under a particular set of assay 
conditions may be empirically determined by one of skill in the art. 

The primers and probes can be prepared by any suitable method, including, for 
example, cloning and restriction of appropriate sequences and direct chemical synthesis by a 

15 method such as the phosphodi ester method of Narang et al.(1979), the phosphodiester method 

of Brown et al.(1979), the diethylphosphoramidite method of Beaucage et al.(1981) and the 
solid support method described in EP 0 707 592. 

Detection probes are generally nucleic acid sequences or uncharged nucleic acid 
analogs such as, for example peptide nucleic acids which are disclosed in International Patent 

20 Application WO 92/20702, morpholino analogs which are described in U.S. Patents Numbered 

5,185,444; 5,034,506 and 5,142,047, The probe may have to be rendered "non-extendable" in 
that additional dNTPs cannot be added to the probe. In and of themselves analogs usually are 
non-extendable and nucleic acid probes can be rendered non-extendable by modifying the 3' end 
of the probe such that the hydroxyl group is no longer capable of participating in elongation. 

25 For example, the 3 f end of the probe can be functionalized with the capture or detection label to 

thereby consume or otherwise block the hydroxyl group. Alternatively, the 3' hydroxyl group 
simply can be cleaved, replaced or modified; U.S. Patent Application Serial No. 07/049,061 
filed April 19, 1993, describes modifications which can be used to render a probe non- 
extendable, 

30 Any of the polynucleotides of the present invention can be labeled, if desired, by 

incorporating a label detectable by spectroscopic, photochemical, biochemical, 
immunochemical, or chemical means. For example, useful labels include radioactive 
32 35 3 125 

substances ( P, S, H, I), fluorescent dyes (5-bromodesoxyuridin, fluorescein, 
acetylaminofluorene, digoxigenin) or biotin. Preferably, polynucleotides are labeled at their 3* 
35 and 5' ends. Examples of non-radioactive labeling of nucleic acid fragments are described in 
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the French patent No. FR-78 10975 or by Urdeaet al (1988) or Sanchez-Pescador et al (1988). 
In addition, the probes according to the present invention may have structural characteristics 
such that they allow the signal amplification, such structural characteristics being, for example, 
branched DNA'probes as those described by Urdea et al. in 1991 or in the European patent No. 
5 EP 0 225 807 (Chiron). 

A label can also be used to capture the primer, so as to facilitate the immobilization of 
either the primer or. a primer extension product, such as amplified DNA, on a solid support. A 
capture label is attached to the primers or probes and can be a specific binding member which 
forms a binding pair with the solid's phase reagent's specific binding member (e.g. biotin and 

10 streptavidin). Therefore depending upon the type of label carried by a polynucleotide or a 

probe, it may be employed to capture or to detect the target DNA. Further, it will be understood 
that the polynucleotides, primers or probes provided herein, may, themselves, serve as the 
capture label. For example, in the case where a solid phase reagent's binding member is a 
nucleic acid sequence, it may be selected such that it binds a complementary portion of a primer 

1 5 or probe to thereby immobilize the primer or probe to the solid phase. In cases where a 

polynucleotide probe itself serves as the binding member, those skilled in the art will recognize 
that the probe will contain a sequence or "tail" that is not complementary to the target. In the 
case where a polynucleotide primer itself serves as the capture label, at least a portion of the 
primer will be free to hybridize with a nucleic acid on a solid phase. DNA Labeling techniques 

20 are well known to the skilled technician. 

The probes of the present invention are useful for a number of purposes. They can be 
notably used in Southern hybridization to genomic DNA. The probes can also be used to detect 
PCR amplification products. They may also be used to detect mismatches in a sequence 
comprising a polynucleotide of SEQ ID Nos 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32, or a g35030 

25 polynucleotide or gene or mRNA using other techniques. 

Any of the polynucleotides, primers and probes of the present invention can be 
conveniently immobilized on a solid support. Solid supports are known to those skilled in the 
art and include the walls of wells of a reaction tray, test tubes, polystyrene beads, magnetic 
beads, nitrocellulose strips, membranes, microparticles such as latex particles, sheep (or other 

30 animal) red blood cells, duracytes and others. The solid support is not critical and can be 

selected by one skilled in the art. Thus, latex particles, microparticles, magnetic or non- 
magnetic beads, membranes, plastic tubes, walls of microtiter wells, glass or silicon chips, 
sheep (or other suitable animal's) red blood cells and duracytes are all suitable examples. 
Suitable methods for immobilizing nucleic acids on solid phases include ionic, hydrophobic, 

35 covalent interactions and the like. A solid support, as used herein, refers to any material which 
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is insoluble, or can be made insoluble by a subsequent reaction. The solid support can be 
chosen for its intrinsic ability to attract and immobilize the capture reagent. Alternatively, the 
solid phase can retain an additional receptor which has the ability to attract and immobilize the 
capture reagent The additional receptor can include a charged substance that is oppositely 
5 charged with respect to the capture reagent itself or to a charged substance conjugated to the 

capture reagent. As yet another alternative, the receptor molecule can be any specific binding 
member which is immobilized upon (attached to) the solid support and which has the ability to 
immobilize the capture reagent through a specific binding reaction. The receptor molecule 
enables the indirect binding of the capture reagent to a solid support material before the 

10 performance of the assay or during the performance of the assay. The solid phase thus can be a 

plastic, derivatized plastic, magnetic or non-magnetic metal, glass or silicon surface of a test 
tube, microtiter well, sheet, bead, microparticle, chip, sheep (or other suitable animal's) red 
blood cells, duracytes and other configurations known to those of ordinary skill in the art. The 
polynucleotides of the invention can be attached to or immobilized on a solid support 

15 individually or in groups of at least 2, 5, 8, 10, 12, 15, 20, or 25 distinct polynucleotides of the 

invention to a single solid support. In addition, polynucleotides other than those of the 
invention may be attached to the same solid support as one or more polynucleotides of the 
invention. 

Consequently, the invention also comprises a method for detecting the presence of a 
20 nucleic acid comprising a nucleotide sequence selected from a group consisting of SEQ ID Nos. 

1 to 17, 24, 25, 28, 74 and 79 to 132, a fragment or a variant thereof or a complementary 
sequence thereto in a sample, said method comprising the following steps of: 

a) bringing into contact a nucleic acid probe or a plurality of nucleic acid probes which 
can hybridize with a nucleotide sequence included in a nucleic acid selected form the group 

25 consisting of the nucleotide sequences of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132, a 

fragment or a variant thereof or a complementary sequence thereto and the sample to be 
assayed; and 

b) detecting the hybrid complex formed between the probe and a nucleic acid in the 

sample. 

30 The invention further concerns a kit for detecting the presence of a nucleic acid 

comprising a nucleotide sequence selected from a group consisting of SEQ ID Nos. 1 to 17, 24, 
25, 28, 74 and 79 to 132, a fragment or a variant thereof or a complementary sequence thereto 
in a sample, said kit comprising: 

a) a nucleic acid probe or a plurality of nucleic acid probes which can hybridize with a 

35 nucleotide sequence included in a nucleic acid selected form the group consisting of the 
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nucleotide sequences of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132, a fragment or a 
variant thereof or a complementary sequence thereto; and 

b) optionally, the reagents necessary for performing the hybridization reaction. 

In a first preferred embodiment of this detection method and kit, said nucleic acid probe 
5 or the plurality of nucleic acid probes are labeled with a detectable molecule. In a second 

preferred embodiment of said method and kit, said nucleic acid probe or the plurality of nucleic 
acid probes has been immobilized on a substrate. In a third preferred embodiment, the nucleic 
acid probe or the plurality of nucleic acid probes comprise either a sequence which is selected 
from the group consisting of the nucleotide sequences of PI to P127 and the complementary 
10 sequence thereto, Bl to Bl 14, CI to CI 14, Dl to D127, El to E127, or a nucleotide sequence 

comprising a biallelic marker selected from the group consisting of A I to A127, or the 
complements thereto. 

Oligonucleotide Arrays 

A substrate comprising a plurality of oligonucleotide primers or probes of the invention 
1 5 may be used either for detecting or amplifying targeted sequences in a nucleotide sequence of 

SEQ ID No. 1, more particularly in a g35030 polynucleotide, or in genes comprising a g35030 
polynucleotide and may also be used for detecting mutations in the coding or in the non-coding 
sequences of a g35030 nucleic acid sequence, or genes comprising a g35O30 nucleic acid 
sequence. 

20 Any polynucleotide provided herein may be attached in overlapping areas or at random 

locations on the solid support. Alternatively the polynucleotides of the invention may be 
attached in an ordered array wherein each polynucleotide is attached to a distinct region of the 
solid support which does not overlap with the attachment site of any other polynucleotide. 
Preferably, such an ordered array of polynucleotides is designed to be "addressable" where the 

25 distinct locations are recorded and can be accessed as part of an assay procedure. Addressable 

polynucleotide arrays typically comprise a plurality of different oligonucleotide probes that are 
coupled to a surface of a substrate in different known locations. The knowledge of the precise 
location of each polynucleotides location makes these "addressable" arrays particularly useful 
in hybridization assays. Any addressable array technology known in the art can be employed 

30 with the polynucleotides of the invention. One particular embodiment of these polynucleotide 

arrays is known as Genechips™, and has been generally described in US Patent 5,143,854; 
PCT publications WO 90/15070 and 92/10092. These anays may generally be produced using 
mechanical synthesis methods or light directed synthesis methods which incorporate a 
combination of photolithographic methods and solid phase oligonucleotide synthesis (Fodor et 

35 al„ 1 991 , incorporated herein by reference). The immobilization of arrays of oligonucleotides 
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on solid supports has been rendered possible by the development of a technology generally 
identified as "Very Large Scale Immobilized Polymer Synthesis" (VLSIPS™) in which, 
typically, probes are immobilized in a high density array on a solid surface of a chip. Examples 
of VLSIPS™ technologies are provided in US Patents 5,143,854; and 5,412,087 and in PCT 
5 Publications WO 90/15070, WO 92/10092 and WO 95/1 1995, which describe methods for 

forming oligonucleotide arrays through techniques such as light-directed synthesis techniques. 
In designing strategies aimed at providing arrays of nucleotides immobilized on solid supports, 
further presentation strategies were developed to order and display the oligonucleotide arrays on 
the chips in an attempt to maximize hybridization patterns and sequence information. Examples 

1 0 of such presentation strategies are disclosed in PCT Publications WO 94/1 2305, WO 94/1 1530, 

WO 97/29212 and WO 97/31256. 

In another embodiment of the oligonucleotide arrays of the invention, an 
oligonucleotide probe matrix may advantageously be used to detect mutations occurring in a 
g35030 polynucleotide, including in genes comprising a g35030 polynucleotide and preferably 

15 in a g35030 polynucleotide regulatory region. For this particular purpose, probes are 

specifically designed to have a nucleotide sequence allowing their hybridization to the genes 
that carry known mutations (either by deletion, insertion or substitution of one or several 
nucleotides). By known mutations in a §35030 polynucleotide, it is meant, mutations in a 
g35030 polynucleotide that have been identified according; the technique used by Huang et 

20 al.(1996) or Samson et al.(1996), for example, may be used to identify such mutations. 

Another technique that is used to detect mutations in a g35030 polynucleotide is the use 
of a high-density DNA array. Each oligonucleotide probe constituting a unit element of the 
high density DNA array is designed to match a specific subsequence of a g3 5030 
polynucleotide. Thus, an array consisting of oligonucleotides complementary to subsequences 

25 of the target gene sequence is used to determine the identity of the target sequence with the 

wild-type gene sequence/measure its amount, and detect differences between the target 
sequence and the reference wild-type nucleic acid sequence of a g3 5030 polynucleotide. In one 
such design, termed 4L tiled array, is implemented a set of four probes (A, C, G, T), preferably 
15-nucleotide oligomers. In each set of four probes, the perfect complement will hybridize 

30 more strongly than mismatched probes. Consequently, a nucleic acid target of length L is 

scanned for mutations with a tiled array containing 4L probes, the whole probe set containing 
all the possible mutations in the known wild reference sequence. The hybridization signals of 
the 1 5-mer probe set tiled array are perturbed by a single base change in the target sequence. 
As a consequence, there is a characteristic loss of signal or a "footprint" for the probes flanking 

35 a mutation position. This technique was described by Chee et al. in 1996. 
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Consequently, the invention concerns an array of nucleic acid molecules comprising at 
least one polynucleotide described above as probes and primers. Preferably, the invention 
concerns an array of nucleic acid comprising at least two polynucleotides described above as 
probes and primers. 

5 G35030 P ro teins and Polypeptide Fragments: 

The terms " g35030 polypeptides ", are used herein to embrace all of the proteins and 
polypeptides encoded by the respective g3503O polypeptides of the present invention. Forming 
part of the invention are polypeptides encoded by the polynucleotides of the invention, as well 
as fusion polypeptides comprising such polypeptides. The invention embodies proteins from 

10 humans, mammals, primates, non-human primates, and includes isolated or purified g35030 

proteins consisting, consisting essentially, or comprising the sequence of SEQ ID Nos 1 8 to 23. 

It should be noted that the g35030 proteins of the invention also comprise naturally- 
occurring variants of the amino acid sequence of the respective human g35030 proteins. 
The present invention embodies isolated, purified, and recombinant polypeptides 

15 comprising a contiguous span of at least 4 amino acids, preferably at least 6, more preferably at 

least 8 to 1 0 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, 60, 70, 80, 90, 95 
or 1 00 amino acids, to the extent that said span is consistent with the length of a particular SEQ • 
ID, of SEQ ID Nos 18 to 23. In other preferred embodiments the contiguous stretch of amino 
acids comprises the site of a mutation or functional mutation, including a deletion, addition, 

20 swap or truncation of the amino acids in a g3 5 030 protein sequence. In one aspect, the 

contiguous stretch of amino acids comprises deletion, addition, swap or truncation at amino acid 
position 13 of SEQ ID Nos 18 to 23, corresponding to polymorphism described herein by 
biallelic markers 8-126-286 (also referred to as 8-130-143) and 8-155-258. 

The invention further embodies g3 5 030 polypeptides, including isolated and 

25 recombinant polypeptides, encoded respectively by g3 5030 polynucleotides consisting, 

consisting essentially, or comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 
50, 60, 70, 80, 90, 100, 150, 200 or 500 nucleotides of SEQ ID Nos 2 to 17. 

The invention further embodies g35030 polypeptides, including isolated and 
recombinant polypeptides, encoded respectively by g35030 polynucleotides consisting, 

30 consisting essentially, or comprising a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 

50, 60, 70, 80, 90, 100, 150, 200 or 500 nucleotides, to the extent that the length of said span is 
consistent with the nucleotide position range, of SEQ ID No 1, wherein said contiguous span 
comprises at least 1, 2, 3, 4, 5, 7 or 10 of the following nucleotide positions of SEQ ID No 1: 
201 123 to 201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 
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216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 
to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 244561 and 246273 to 247802, or 
the complements thereof. 

g35030~proteins are preferably isolated from human or mammalian tissue samples or 
5 expressed from human or mammalian genes. The g35030 polypeptides of the invention can be 

made using routine expression methods known in the art. The polynucleotide encoding the 
desired polypeptide, is ligated into an expression vector suitable for any convenient host. Both 
eukaryotic and prokaryotic host systems is used in forming recombinant polypeptides, and a 
summary of some of the more common systems. The polypeptide is then isolated from lysed 
10 cells or from the culture medium and purified to the extent needed for its intended use. 

Purification is by any technique known in the art, for example, differential extraction, salt 
fractionation, chromatography, centrifugation, and the like. See, for example, Methods in 
Enzymology for a variety of methods for purifying proteins. 

In addition, shorter protein fragments can be produced by chemical synthesis, 
15 Alternatively the proteins of the invention is extracted from cells or tissues of humans or non- 

human animals. Methods for purifying proteins are known in the art, and include the use of 
detergents or chaotropic agents to disrupt particles followed by differential extraction and 
separation of the polypeptides by ion exchange chromatography, affinity chromatography, 
sedimentation according to density, and gel electrophoresis. 
20 Any g35030 cDNA or fragment thereof, including the respective cDNA sequences of SEQ 

EDNos 2 to 17, is used to express g35030 proteins and polypeptides. The nucleic acid encoding 
the g35030 protein or polypeptide to be expressed is operably linked to a promoter in an expression . 
vector using conventional cloning technology. The g3 5030 insert in the expression vector may 
comprise the full coding sequence for the respective g35030 protein or a portion thereof. For 
25 example, the g3 5030 derived insert may encode a polypeptide comprising at least 1 0 consecutive 

amino acids of the respective g35030 protein of SEQ IDNos 18 to 23. 

The expression vector is any of the mammalian, yeast, insect or bacterial expression 
systems known in the art. Commercially available vectors and expression systems are available 
from a variety of suppliers including Genetics Institute (Cambridge, MA), Stratagene (La Jolla, 
30 California), Promega (Madison, Wisconsin), and Invitrogen (San Diego, California). If desired, to 

enhance expression and facilitate proper protein folding, the codon context and codon pairing of the 
sequence is optimized for the particular expression organism in which the expression vector is 
introduced, as explained by Hatfield, et al., U.S. Patent No. 5,082,767. 

In one embodiment, the entire coding sequence of the g35030 cDNA through the poly A 
35 signal of the cDNA are operably linked to a promoter in the expression vector. Alternatively, if the 
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nucleic acid encoding a portion of the g3503O protein lacks a methionine to serve as the initiation 
site, an initiating methionine can be introduced next to the first codon of the nucleic acid using 
conventional techniques. Similarly, if the insert from the g35030 cDNA lacks a poly A signal, this 
sequence can be" added to the construct by, for example, splicing out the Poly A signal from pSG5 
5 (Stratagene) using Bgll and Sail restriction endonuclease enzymes and incorporating it into the 

mammalian expression vector pXTl (Stratagene). pXTl contains the LTRs and a portion of the 
gag gene from Moloney Murine Leukemia Virus. The position of the LTRs in the construct allow 
efficient stable transfection. The vector includes the Herpes Simplex Thymidine Kinase promoter 
and the selectable neomycin gene. The nucleic acid encoding the g35030 protein or a portion 

1 0 thereof is obtained by PGR from a bacterial vector containing the a nucleotide sequence of an exon 

of a g35030 gene as described herein and in SEQ ID No 1, or from a g35030 cDNA comprising a 
nucleic acid of SEQ ID No 2 to 17 using oligonucleotide primers complementary to the g35030 
nucleic acid or portion thereof and containing restriction endonuclease sequences for Pst I 
incorporated into the 5' primer and Bglll at the 5' end of the corresponding cDNA 3' primer, taking 

1 5 care to ensure that the sequence encoding the g35030 protein or a portion thereof is positioned 

properly with respect to the poly A signal. The purified fragment obtained from the resulting PCR 
reaction is digested with PstI, blunt ended with an exonuclease, digested with Bgl II, purified and 
ligated to pXTl ,.now containing a poly A signal and digested with BgllL 

The ligated product is transfected into mouse NIH 3T3 cells using Lipofectin (Life 

20 Technologies, Inc., Grand Island, New York) under conditions outlined in the product 

specification. Positive transfectants are selected after growing the transfected cells in 600ugfrnl 
G41 8 (Sigma, St. Louis, Missouri). 

Alternatively, the nucleic acids encoding the g35030 protein or a portion thereof is cloned 
into pED6dpc2 (Genetics Institute, Cambridge, MA). The resulting pED6dpc2 constructs is 

25 transfected into a suitable host cell, such as COS 1 cells. Methotrexate resistant cells are selected 

and expanded. 

The above procedures may also be used to express a mutant g35030 protein responsible 
for a detectable phenotype or a portion thereof. 

The expressed proteins are purified using conventional purification techniques such as 
30 ammonium sulfate precipitation or chromatographic separation based on size or charge. The 

protein encoded by the nucleic acid insert may also be purified using standard 
immunochromatography techniques. In such procedures, a solution containing the expressed 
g35030 protein or portion thereof, such as a cell extract, is applied to a column having antibodies 
against the g35030 protein or portion thereof is attached to the chromatography matrix. The 
35 expressed protein is allowed to bind the immunochromatography column. Thereafter, the column 
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is washed to remove non-specifically bound proteins. The specifically bound expressed protein is 
then released from the column and recovered using standard techniques. 

To confirm expression of the §35030 protein or a portion thereof, the proteins expressed 
from host cells containing an expression vector containing an insert encoding the g35030 protein or 
5 a portion thereof can be compared to the proteins expressed in host cells containing the expression 

vector without an insert. The presence of a band in samples from cells containing the expression 
vector with an insert which is absent in samples from cells containing the expression vector without 
an insert indicates that the g35030 protein or a portion thereof is being expressed. Generally, the 
band will have the mobility expected for the g35030 protein or portion thereof However, the band 

1 0 may have a mobility different than that expected as a result of modifications such as glycosylation, 

ubiquitination, or enzymatic cleavage. 

Antibodies capable of specifically recognizing the expressed g35030 protein or a portion 
thereof are described below. 

If antibody production is not possible, the nucleic acids encoding the g35030 protein or a 

1 5 portion thereof is incorporated into expression vectors designed for use in purification schemes 

employing chimeric polypeptides. In such strategies the nucleic acid encoding the g35030 protein 
or a portion thereof is inserted in frame with the gene encoding the other half of the chimera. The 
other half of the chimera is p-globin or a nickel binding polypeptide encoding sequence. A 
chromatography matrix having antibody to p-giobin or nickel attached thereto is then used to purify 

20 the chimeric protein. Protease cleavage sites is engineered between the P-globin gene or the nickel 

binding polypeptide and the g35030 protein or portion thereof Thus, the two polypeptides of the 
chimera is separated from one another by protease digestion. 

One useful expression vector for generating P-globin chimeric proteins is pSG5 
(Stratagene), which encodes rabbit p-globin. Intron II of the rabbit P-globin gene facilitates 

25 splicing of the expressed transcript, and the polyadenylation signal incorporated into the construct 

increases the level of expression. These techniques are well known to those skilled in the art of 
molecular biology. Standard methods are published in methods texts such as Davis et ah, (1986) 
and many of the methods are available from Stratagene, Life Technologies, Inc., or Promega. 
Polypeptide may additionally be produced from the construct using in vitro translation systems 

30 such as the In vitro Express™ Translation Kit (Stratagene). 

Antibodies That Bind g35030 Polypeptides of the Invention 

Any g35030 polypeptide or whole protein may be used to generate antibodies capable 
of specifically binding to an expressed g35O30 protein or fragments thereof. 

For an antibody composition to specifically bind to a g35030 protein, it must 



WO 01/40493 PCT/IB00/O15O7 

51 

demonstrate at least a 5%, 1 0%, 1 5%, 20%, 25%, 50%, or 1 00% greater binding affinity for full 
length g35030 protein than for any full length protein in an EL1SA, RIA, or other antibody- 
based binding assay. For an antibody composition to specifically bind to a variant g35030 
protein, it must "demonstrate at least a 5%, 10%, 15%, 20%, 25%, 50%, or 100% greater binding 
5 affinity for the respective full length variant g35030 protein than for the respective reference 

g35O30 full length protein in an ELISA, RIA, or other antibody-based binding assay. 

One antibody composition of the invention is capable of specifically binding or 
specifically binds to the respective g35030 proteins of SEQ ID Nos 1 8 to 23. Other antibody 
compositions of the invention are capable of specifically binding or specifically bind to a 

10 g35030 protein variant. 

In one embodiment, the invention concerns antibody compositions, either polyclonal or 
monoclonal, capable of selectively binding, or selectively bind to an epitope-containing a 
polypeptide comprising a contiguous span of at least 6 amino acids, preferably at least 8 to 10 
amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids of a g35030 

15 polypeptide. 

The invention also concerns a purified or isolated antibody capable of specifically 
binding to a mutated g35030 protein or to a fragment or variant thereof comprising an epitope 
of the mutated g35030 protein. In another preferred embodiment, the present invention 
concerns an antibody capable of binding to a polypeptide comprising at least 10 consecutive 

20 amino acids of a g35030 protein and including at least one of the amino acids which can be 

encoded by the trait causing mutations. 

An epitope can comprise as few as 3 amino acids in a spatial conformation, which is 
unique to the epitope. Generally an epitope consists of at least 6 such amino acids, and more 
often at least 8-10 such amino acids. In preferred embodiment, antigenic epitopes comprise a 

25 number of amino acids that is any integer between 3 and 50. Fragments which function as 

epitopes may be produced by any conventional means. Epitopes can be determined by a 
Jameson- Wolf antigenic analysis, for example, performed using the computer program 
PROTEAN, using default parameters (Version 4.0 Windows, DNASTAR, Inc., 1228 South Park 
Street Madison, WI. 

30 Predicted antigenic epitopes are shown below. It is pointed out that the immunogenic 

epitope list describe only amino acid residues comprising epitopes predicted to have the highest 
degree of immunogenicity by a particular algorithm. Polypeptides of the present invention that 
are not specifically described as immunogenic are not considered non-antigen ic. This is 
because they may still be antigenic in vivo but merely not recognized as such by the particular 

35 algorithm used. Alternatively, the polypeptides are probably antigenic in vitro using methods 
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such a phage display. Thus, listed below are the amino acid residues comprising only preferred 
epitopes, not a complete list. In fact, all fragments of the polypeptides of the present invention, 
at least 6 amino acids residues in length, are included in the present invention as being useful as 
antigenic epitope. Moreover, listed below are only the critical residues of the epitopes 
' 5 determined by the Jameson-Wolf analysis. Thus, additional flanking residues on either the N- 

terminal, C-terminal, or both N- and C-terminal ends may be added to the sequences listed to 
generate an epitope-bearing portion at least 6 residues in length. Amino acid residues 
comprising other immunogenic epitopes may be determined by algorithms similar to the 
Jameson- Wolf analysis or by in vivo testing for an antigenic response using the methods 
1 0 described herein or those known in the art. 

The epitope-bearing fragments of the present invention preferably comprises 6 to 50 
amino acids (i.e. any integer between 6 and 50, inclusive) of a polypeptide of the present 
invention. Also, included in the present invention are antigenic fragments between the integers 
of 6 and the full length g35030 sequence. 
15 Preferred g35O30 immunogenic epitopes ; 

SEQ ID No. 1 8: Val-9 to Arg-1 1 ; Gly-2 1 to Lys-22 
SEQ ID No. 19: Val-9 to Arg-1 1 ; Pro-45 to Pro-52 
SEQ ID No. 20: Val-9 to Arg-1 1 
SEQ ID No. 21: Val-9 to Arg-1 1 ; Ser-40 to Ser-42 
20 SEQ ID No. 22: Val-9 to Arg-1 1 

SEQ ID No. 23 : Val-9 to Arg-1 1 ; Pro-45 to Glu-49; Ser-57 to Val-60 

Non-human animals or mammals, whether wild-type or transgenic, which express a 
different species of g35030 than the one to which antibody binding is desired, and animals 

25 which do not express g35030 (i.e. a g35O30 knock out animal as described herein) are 

particularly useful for preparing antibodies. g35030 knock out animals will recognize all or 
most of the exposed regions of a g35030 protein as foreign antigens, and therefore produce 
antibodies with a wider array of g35O30 epitopes. Moreover, smaller polypeptides with only 10 
to 30 amino acids may be useful in obtaining specific binding to any one of the g35030 proteins. 

30 In addition, the humoral immune system of animals which produce a species of g3503O that 

resembles the antigenic sequence will preferentially recognize the differences between the 
animal's native g35030 species and the antigen sequence, and produce antibodies to these 
unique sites in the antigen sequence. Such a technique will be particularly useful in obtaining 
antibodies that specifically bind to any one of the g35O30 proteins. 

35 Antibody preparations prepared according to either protocol are useful in quantitative 
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immunoassays which determine concentrations of antigen-bearing substances in biological 
samples; they are also used semi-quantitatively or qualitatively to identify the presence of 
antigen in a biological sample. The antibodies may also be used in therapeutic compositions for 
killing ceils expressing the protein or reducing the levels of the protein in the body. 
5 The antibodies of the invention may be labeled by any one of the radioactive, 

fluorescent or enzymatic labels known in the art. 

Consequently, the invention is also directed to a method for detecting specifically the 
presence of a g35030 polypeptide according to the invention in a biological sample, said 
method comprising the following steps: 
10 a) bringing into contact the biological sample with a polyclonal or monoclonal antibody 

that specifically binds a g35030 polypeptide comprising an amino acid sequence of SEQ CD 
Nos. 18 to 23, or to a peptide fragment or variant thereof; and 

b) detecting the antigen-antibody complex formed. 

The invention also concerns a diagnostic kit for detecting in vitro the presence of a 
1 5 g35030 polypeptide according to the present invention in a biological sample, wherein said kit 

comprises: 

a) a polyclonal or monoclonal antibody that specifically binds a g35030 polypeptide 
comprising an amino acid sequence of SEQ ID Nos 18 to 23, or to a peptide fragment or variant 
thereof, optionally labeled; 
20 b) a reagent allowing the detection of the antigen-antibody complexes formed, said 

reagent carrying optionally a label, or being able to be recognized itself by a labeled reagent, 
more particularly in the case when the above-mentioned monoclonal or polyclonal antibody is 
not labeled by itself. 

The present invention thus relates to antibodies and T-cell antigen receptors (TCR), 
25 which specifically bind the polypeptides, and more specifically, the epitopes of the 

polyepeptides of the present invention, including but not limited to IgG (including IgGl, IgG2, 
IgG3, and IgG4), IgA (including IgA 1 and IgA2), IgD, IgE, or IgM, and IgY. In a preferred 
embodiment the antibodies are human antigen binding antibody fragments of the present 
invention include, but are not limited to, Fab, Fab' F(ab)2 and F(ab')2, Fd, single-chain Fvs 
30 (scFv), single-chain antibodies, disulfi de-linked Fvs (sdFv) and fragments comprising either a 

V L or Vh domain. The antibodies may be from any animal origin including birds and mammals. 
Preferably, the antibodies are human, murine, rabbit, goat, guinea pig, camel, horse, or chicken. 

Antigen-binding antibody fragments, including single-chain antibodies, may comprise 
the variable region(s) alone or in combination with the entire or partial of the following: hinge 
35 region, CHI, CH2, and CH3 domains. Also included in the invention are any combinations of 
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variable region(s) and hinge region, CHI > CH2, and CH3 domains. The present invention 
further includes chimeric, humanized, and human monoclonal and polyclonal antibodies, which 
specifically bind the polypeptides of the present invention. The present invention further 
includes antibodies that are anti-idiotypic to the antibodies of the present invention. 
5 The antibodies of the present invention may be monospecific, bispecific, trispecific or 

have greater multispecificity. Multispecific antibodies may be specific for different epitopes of 
a polypeptide of the present invention or may be specific for both a polypeptide of the present 
invention as well as for heterologous compositions, such as a heterologous polypeptide or solid 
support material. See, e.g., WO 93/17715; WO 92/08802; WO 91/00360; WO 92/05793; Tutt, 

10 A. et al. (3991) J. Immunol. 147:60-69; US Patents 5,573,920, 4,474,893, 5,601,819, 4,714,681, 

4,925,648; Kostelny, SA. et al. (1992) J. Immunol. 148:1547-1553. 

Antibodies of the present invention may be described or specified in terms of the 
epitope(s) or epitope-bearing portion(s) of a polypeptide of the present invention, which are 
recognized or specifically bound by the antibody. In the case of proteins of the present 

15 invention secreted proteins, the antibodies may specifically bind a full-length protein encoded 

by a nucleic acid of the present invention, a mature protein (i.e., the protein generated by 
cleavage of the signal peptide) encoded by a nucleic acid of the present invention, a signal 
peptide encoded by a nucleic acid of the present invention, or any other polypeptide of the 
present invention. Therefore, the epitope(s) or epitope bearing polypeptide portion(s) may be 

20 specified as described herein, e.g., by N-terminal and C-terminal positions, by size in contiguous 

amino acid residues, or otherwise described herein (including the squence listing). Antibodies 
which specifically bind any epitope or polypeptide of the present invention may also be 
excluded as individual species. Therefore, the present invention includes antibodies that 
specifically bind specified polypeptides of the present invention, and allows for the exclusion of 

25 the same. 

Antibodies of the present invention may also be described or specified in terms of their 
cross-reactivity. Antibodies that do not specifically bind any other analog, ortholog, or homolog 
of the polypeptides of the present invention are included. Antibodies that do not bind 
polypeptides with less than 95%, less than 90%, less than 85%, less than 80%, less than 75%, 

30 less than 70%, less than 65%, less than 60%, less than 55%, and less than 50% identity (as 

calculated using methods known in the art and described herein) to a polypeptide of the present 
invention are also included in the present invention. Further included in the present invention 
are antibodies, which only bind polypeptides encoded by polynucleotides, which hybridize to a 
polynucleotide of the present invention under stringent hybridization conditions (as described 

35 herein). Antibodies of the present invention may also be described or specified in terms of their 
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binding affinity. Preferred binding affinities include those with a dissociation constant or Kd 
less than 5X 1 0* 6 M, 1 0*M, 5X 1 0* 7 M, 1 0* 7 M, 5X 1 O^M, 1 (T 8 M, 5X 1 (r 9 M, 1 0* 9 M, 5X 1 0' 10 M, 1 0* 
,0 M, 5X10* n M, 10- n M, 5X10' l2 M> 10' t2 M, 5Xl<T n M, I0' u M, 5X10* i4 M s l(r I4 M, 5X1(T 15 M, and 
10' 15 M, 

5 Antibodies of the present invention have uses that include, but are not limited to, 

methods known in the art to purify, detect, and target the polypeptides of the present invention 
including both in vitro and in vivo diagnostic and therapeutic methods. For example, the 
antibodies have use in immunoassays for qualitatively and quantitatively measuring levels of the 
polypeptides of the present invention in biological samples. See, e.g., Harlow et al., 

1 0 ANTIBODIES: A LABORATORY MANUAL, (Cold Spring Harbor Laboratory Press, 2nd ed. 

1988) (incorporated by reference in the entirety). 

The antibodies of the present invention may be used either alone or in combination with 
other compositions. The antibodies may further be recombinantly fused to a heterologous 
polypeptide at the N- or C-terminus or chemically conjugated (including covalent and non- 

1 5 covalent conjugations) to polypeptides or other compositions. For example, antibodies of the 

present invention may be recombinantly fused or conjugated to molecules useful as labels in 
detection assays and effector molecules such as heterologous polypeptides, drugs, or toxins. See, 
e.g., WO 92/08495; WO 91/14438; WO 89/12624; US Patent 5,314,995; and EP 0 396 387. 

The antibodies of the present invention may be prepared by any suitable method known 

20 in the art. For example, a polypeptide of the present invention or an antigenic fragment thereof 

can be administered to an animal in order to induce the production of sera containing polyclonal 
antibodies. The term "monoclonal antibody" is not limited to antibodies produced through 
hybridoma technology. The term "antibody" refers to a polypeptide or group of polypeptides 
which are comprised of at least one binding domain, where a binding domain is formed from 

25 the folding of variable domains of an antibody molecule to form three-dimensional binding 

spaces with an internal surface shape and charge distribution complementary to the features of 
an antigenic determinant of an antigen., which allows an immunological reaction with the 
antigen. The term "monoclonal antibody" refers to an antibody that is derived from a single 
clone, including eukaryotic, prokaryotic, or phage clone, and not the method by which it is 

30 produced. Monoclonal antibodies can be prepared using a wide variety of techniques known in 

the art including the use of hybridoma, recombinant, and phage display technology. 

Hybridoma techniques include those known in the art (See, e.g., Harlow et al. (1998); 
Hammerling, et al. (1981) (said references incorporated by reference in their entireties). Fab 
and F(ab')2 fragments may be produced, for example, from hybridoma-produced antibodies by 

35 proteolytic cleavage, using enzymes such as papain (to produce Fab fragments) or pepsin (to 
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produce F(ab')2 fragments). 

Alternatively, antibodies of the present invention can be produced through the 
application of recombinant DNA technology or through synthetic chemistry using methods 
known in the art. For example, the antibodies of the present invention can be prepared using 
5 various phage display methods known in the art. In phage display methods, functional antibody 

domains are displayed on the surface of a phage particle, which carries polynucleotide 
sequences encoding them. Phage with a desired binding property are selected from a repertoire 
or combinatorial antibody library (e.g. human or murine) by selecting directly with antigen, 
typically antigen bound or captured to a solid surface or bead. Phage used in these methods are 

10 typically filamentous phage including fd and M13 with Fab, Fv or disulfide stabilized Fv 

antibody domains recombinantly fused to either the phage gene III or gene VIII protein. 
Examples of phage display methods that can be used to make the antibodies of the present 
invention include those disclosed in Brinkman U. et al. (1995); Ames, R.S. et al. (1995); 
Kettleborough, C.A. etal. (1994); Persic, L. etal. (1997); Burton, D.R.et al. (1994); 

15 PCT/GB91/01134; WO 90/02809; WO 91/10737; WO92/01047; WO 92/1 86 19; WO 93/1 1236; 

WO 95/15982; WO 95/20401; and US Patents 5,698,426, 5,223,409, 5,403,484, 5,580,717, 
5,427,908, 5,750,753, 5,821,047, 5,571,698, 5,427,908, 5,516,637, 5,780,225, 5,658,727 and 
5,733,743 (said references incorporated by reference in their entireties). 

As described in the above references, after phage selection, the antibody coding regions 

20 from the phage can be isolated and used to generate whole antibodies, including human 

antibodies, or any other desired antigen binding fragment, and expressed in any desired host 
including mammalian cells, insect cells, plant cells, yeast, and bacteria. For example, 
techniques to recombinantly produce Fab, Fab' F(ab)2 and F(ab')2 fragments can also be 
employed using methods known in the art such as those disclosed in WO 92/22324; Mullinax, 

25 R.L. et al. (1992); and Sawai, H. et al. (1995); and Better, M. et al. (1988) (said references 

incorporated by reference in their entireties). 

Examples of techniques which can be used to produce single-chain Fvs and antibodies 
include those described in U.S. Patents 4,946,778 and 5,258,498; Huston et al. (1991); Shu, L. et 
al. (1993); and Skerra, A, et al. (1988). For some uses, including in vivo use of antibodies in 

30 humans and in vitro detection assays, it may be preferable to use chimeric, humanized, or 

human antibodies. Methods for producing chimeric antibodies are known in the art. See e.g., 
Morrison (1985); Oi etal, (1986); Gillies, S.D. et al. (1989); and US Patent 5,807,715. 
Antibodies can be humanized using a variety of techniques including CDR-grafting (EP 0 239 
400; WO 91/09967; US Patent 5,530,101; and 5,585,089), veneering or resurfacing (EP 0 592 

35 106; EP 0 519 596; Padlan E.A., (1991); Studnicka G.M. et al. (1994); Roguska M.A. et al. 
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(1994), and chain shuffling (US Patent 5,565,332). Human antibodies can be made by a variety 
of methods known in the art including phage display methods described above. See also, US 
Patents 4,444,887, 4,716,11 1, 5,545,806, and 5,814,318; WO 98/46645; WO 98/50433; WO 
98/24893; WO96/34096; WO 96/33735; and WO 91/10741 (said references incorporated by 
5 reference in their entireties). 

Further included in the present invention are antibodies recombinantly fused or 
chemically conjugated (including both covalently and non-covalently conjugations) to a 
polypeptide of the present invention. The antibodies may be specific for antigens other than 
polypeptides of the present invention. For example, antibodies may be used to target the 

10 polypeptides of the present invention to particular ceil types, either in vitro or in vivo, by fusing 

or conjugating the polypeptides of the present invention to antibodies specific for particular cell 
surface receptors. Antibodies fused or conjugated to the polypeptides of the present invention 
may also be used in in vitro immunoassays and purification methods using methods known in 
the art. See e.g., Harbor et al, supra and WO 93/2 1 232; EP 0 439 095; Naramura, M. et al. 

15 (1994); US Patent 5,474,981; Gillies, S O. et al. (1992); Fell, H.P. et al (1991) (said references 

incorporated by reference in their entireties). 

The present invention further includes compositions comprising the polypeptides of the 
present invention fused or conjugated to antibody domains other than the variable regions, for 
example, the polypeptides of the present invention may be fused or conjugated to an antibody Fc 

20 region, or portion thereof. The antibody portion fused to a polypeptide of the present invention 

may comprise the hinge region, CHI domain, CH2 domain, and CH3 domain or any 
combination of whole domains or portions thereof. The polypeptides of the present invention 
may be fused or conjugated to the above antibody portions to increase the in vivo half-life of the 
polypeptides or for use in immunoassays using methods known in the art. The polypeptides 

25 may also be fused or conjugated to the above antibody portions to form multimers. For 

example, Fc portions fused to the polypeptides of the present invention can form dimers through 
disulfide bonding between the Fc portions. Higher muitimeric forms can be made by fusing the 
polypeptides to portions of IgA and IgM. Methods for fusing or conjugating the polypeptides of 
the present invention to antibody portions are known in the art. See e.g., US Patents 5,336,603, 

30 5,622,929, 5,359,046, 5,349,053, 5,447,851, 5,1 12,946; EP 0 307 434, EP 0 367 166; WO 

96/04388, WO 91/06570; Ashkenazi, A. et al. (1991); Zheng, XX et al. (1995); and Vil, H. et 
al. (1992) (said references incorporated by reference in their entireties). 

The invention further relates to antibodies that act as agonists or antagonists of the 
polypeptides of the present invention. For example, the present invention includes antibodies 

35 that disrupt the receptor/Hgand interactions with the polypeptides of the invention either 
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partially or fully. Included are both receptor-specific antibodies and ligand-specific antibodies. 
Included are receptor-specific antibodies, which do not prevent ligand binding but prevent 
receptor activation. Receptor activation (i.e., signaling) may be determined by techniques 
described herein or otherwise known in the art. Also include are receptor-specific antibodies 

5 which both prevent ligand binding and receptor activation. Likewise, included are neutralizing 

antibodies that bind the ligand and prevent binding of the ligand to the receptor, as well as 
antibodies that bind the ligand, thereby preventing receptor activation, but do not prevent the 
ligand from binding the receptor. Further included are antibodies that activate the receptor. 
These antibodies may act as agonists for either all or less than all of the biological activities 

10 affected by Iigand-mediated receptor activation. The antibodies may be specified as agonists or 

antagonists for biological activities comprising specific activities disclosed herein. The above 
antibody agonists can be made using methods known in the art. See e.g., WO 96/40281; US 
Patent 5,81 1,097; Deng, B. et al. (1998); Chen, Z. et al. (1998); Harrop, J.A. et al. (1998); Zhu, 
Z. et al. (1998); Yoon, D.Y. et al. (1998); Prat, M. et al. (1998); Pitard, V. et al. (1997); 

15 Liautard, J. et al. (1997); Carlson, N.G. et al. (1997); Taryman, R.E. et al. (1995); Mulier, Y.A. 

et al. (1998); Bartunek, P. et al. (1996) (said references incorporated by reference in their 
entireties). 

As discussed above, antibodies of the polypeptides of the invention can, in turn, be 
utilized to generate anti-idiotypic antibodies that "mimic" polypeptides of the invention using 

20 techniques well known to those skilled in the art. See, e.g. Greenspan and Bona, (1989); 

Nissinoff, (1991), For example, antibodies which bind to and competitively inhibit polypeptide 
multimerization or binding of a polypeptide of the invention to ligand can be used to generate 
antiidiotypes that "mimic" the polypeptide multimerization or binding domain and, as a 
consequence^ bind to and neutralize polypeptide or its ligand. Such neutralization anti-idiotypic 

25 antibodies can be used to bind a polypeptide of the invention or to bind its ligands/receptors, 

and therby block its biological activity. 

Biallelic markers of the inventions 

Advantages of the biallelic markers of the present invention 

30 The biallelic marker of the inventions of the present invention offer a number of 

important advantages over other genetic markers such as RFLP (Restriction fragment length 
polymorphism) and VNTR (Variable Number of Tandem Repeats) markers. 

The first generation of markers, were RFLPs, which are variations that modify the 
length of a restriction fragment. But methods used to identify and to type RFLPs are relatively 

35 wasteful of materials, effort, and time. The second generation of genetic markers were YNTRs, 
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which can be categorized as either minisatellites or microsatellites. Minisatellites are tandem ly 
repeated DNA sequences present in units of 5-50 repeats which are distributed along regions of 
the human chromosomes ranging from 0.1 to 20 kilobases in length. Since they present many 
possible alleles, their informative content is very high. Minisatellites are scored by performing 
5 Southern blots to identify the number of tandem repeats present in a nucleic acid sample from 

4 

the individual being tested. However, there are only 10 potential VNTRsthat can be typed by 
Southern blotting. Moreover, both RFLP and VNTR markers are costly and time-consuming to 
develop and assay in large numbers. 

Single nucleotide polymorphism or biallelic markers can be used in the same manner as 
10 RFLPs and VNTRs but offer several advantages. Single nucleotide polymorphisms are densely 

spaced in the human genome and represent the most frequent type of variation. An estimated 

number of more than 10^ sites are scattered along the 3x1 0 9 base pairs of the human genome. 
Therefore, single nucleotide polymorphism occur at a greater frequency and with greater 
uniformity than RFLP or VNTR markers which means that there is a greater probability that 

1 5 such a marker will be found in close proximity to a genetic locus of interest Single nucleotide 

polymorphisms are less variable than VNTR markers but are mutational ly more stable. 

Also, the different forms of a characterized single nucleotide polymorphism, such as the 
biallelic markers of the present invention, are often easier to distinguish and can therefore be 
typed easily on a routine basis. Biallelic markers have single nucleotide based alleles and they 

20 have only two common alleles, which allows highly parallel detection and automated scoring. 

The biallelic markers of the present invention offer the possibility of rapid, high-throughput 
genotyping of a large number of individuals. 

Biallelic markers are densely spaced in the genome, sufficiently informative and can be 
assayed in large numbers. The combined effects of these advantages make biallelic markers 

25 extremely valuable in genetic studies. Biallelic markers can be used in linkage studies in 

families, in allele sharing methods, in linkage disequilibrium studies in populations, in 
association studies of case-control populations, An important aspect of the present invention is 
that biallelic markers allow association studies to be performed to identify genes involved in 
complex traits. Association studies examine the frequency of marker alleles in unrelated case- 

30 and control-populations and are generally employed in the detection of polygenic or sporadic 

traits. Association studies may be conducted within the general population and are not limited 
to studies performed on related individuals in affected families (linkage studies). Biallelic 
markers in different genes can be screened in parallel for direct association with disease or 
response to a treatment. This multiple gene approach is a powerful too! for a variety of human 

35 genetic studies as it provides the necessary statistical power to examine the synergistic effect of 
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multiple genetic factors on a particular phenotype, drug response, sporadic trait, or disease state 
with a complex genetic etiology. 

Biallelic Markers And Polynucleotides Comprising Them 
5 Polynucleotides of the present invention 

In one aspect, the invention concerns biallelic markers associated with schizophrenia. 
The invention comprises chromosome 13q31-q33-related biallelic markers and g35030-related 
biallelic markers. The markers are generally referred to herein as Al , A2, A3 and so on. The 
biallelic markers of the invention comprise the biallelic markers designated Al to A127 in 
10 Table 5. Also included are biallelic markers in linkage disequilibrium with the biallelic markers 

of the invention. 

The polynucleotide of the invention may consist of, consist essentially of, or comprise a 
contiguous span of nucleotides of a sequence from any of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 
and 79 to 132 as well as sequences which are complementary thereto ("complements thereof 1 ). 

15 The "contiguous span" may be at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 80, 100, 250, 

500, 1000 or 2000 nucleotides in length, to the extent that a contiguous span of these lengths is 
consistent with the lengths of the particular Sequence ID. 

The present invention encompasses polynucleotides for use as primers and probes in the 
methods of the invention. These polynucleotides may consist of, consist essentially of, or 

20 comprise a contiguous span of nucleotides of a sequence from any of SEQ ID Nos. 1 to 1 7, 24, 

25, 28, 74 and 79 to 1 32 as well as sequences which are complementary thereto ("complements 
thereof 1 ). The "contiguous span" may be at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 80, 
100, 250, 500 , 1000 or 2000 nucleotides in length, to the extent that a contiguous span of these 
lengths is consistent with the lengths of the particular Sequence ID. It should be noted that the 

25 polynucleotides of the present invention are not limited to having the exact flanking sequences 

surrounding the polymorphic bases which, are enumerated in the Sequence Listing. Rather, it 
will be appreciated that the flanking sequences surrounding the biallelic markers and other 
polymorphisms of the invention, or any of the primers of probes of the invention which, are 
more distant from the markers, may be lengthened or shortened to any extent compatible with 

30 their intended use and the present invention specifically contemplates such sequences. It will be 

appreciated that the polynucleotides of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 may 
be of any length compatible with their intended use. Also the flanking regions outside of the 
contiguous span need not be homologous to native flanking sequences which actually occur in 
human subjects. The addition of any nucleotide sequence, which is compatible with the 

35 nucleotides intended use is specifically contemplated. The contiguous span may optionally 
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include the biallelic markers of the invention in said sequence. Biailelic markers generally 
comprise a polymorphism at one single base position. Each biallelic marker therefore 
corresponds to two forms of a polynucleotide sequence which, when compared with one 
another, present a nucleotide modification at one position. Usually, the nucleotide modification 
5 involves the substitution of one nucleotide for another. Optionally allele 1 or allele 2 of the 

biallelic markers disclosed in Table 5 may be specified as being present at the biallelic marker 
of the invention. Preferred polynucleotides may consist of, consist essentially of, or comprise a 
contiguous span of nucleotides of a sequence from SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 
to 132 as well as sequences which are complementary thereto. The "contiguous span" may be 

10 at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 80, 100, 250, 500, 1000 or 2000 nucleotides in 

length, to the extent that a contiguous span of these lengths is consistent with the lengths of the 
particular Sequence ID. 

A preferred probe or primer comprises a nucleic acid comprising a polynucleotide 
selected from the group of the nucleotide sequences of PI to P127 and the complementary 

15 sequence thereto, Bl to Bl 14, CI to CI 14, Dl to Dl 27, El toE127. 

The invention also relates to polynucleotides that hybridize, under conditions of high or 
intermediate stringency, to a polynucleotide of any of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 
79 to 132 as well as sequences, which are complementary thereto. Preferably such 
polynucleotides are at least 20, 25, 35, 40, 50, 70, 80, 100, 250, 500 , 1000 or 2000 nucleotides 

20 in length, to the extent that a polynucleotide of these lengths is consistent with the lengths of the 

particular Sequence ID. Preferred polynucleotides comprise a polymorphism of the invention. 
Optionally either allele 1 or allele 2 of the polymorphism disclosed in Table 5 may be specified 
as being present at the polymorphism of the invention* Particularly preferred polynucleotides 
comprise a biallelic marker of the invention. Conditions of high stringency are further 

25 described herein. 

The primers of the present invention may be designed from the disclosed sequences for 
any method known in the art. A preferred set of primers is fashioned such that the 3' end of the 
contiguous span of identity with the sequences of any of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 
and 79 to 132 is present at the 3' end of the primer. Such a configuration allows the 3' end of 

30 the primer to hybridize to a selected nucleic acid sequence and dramatically increases the 

efficiency of the primer for amplification or sequencing reactions. In a preferred set of primers 
the contiguous span is found in one of the sequences described in Table 4. Allele specific 
primers may be designed such that a biallelic marker or other polymorphism of the invention is 
at the 3 1 end of the contiguous span and the contiguous span is present at the 3* end of the 

35 primer. Such allele specific primers tend to selectively prime an amplification or sequencing 
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reaction so long as they are used with a nucleic acid sample that contains one of the two alleles 
present at said marker. The 3 r end of primer of the invention may be located within or at least 
2, 4, 6, 8, 10, 12, 15, 18, 20, 25, 50, 100, 250, 500, or 1000 nucleotides upstream of a biallelic 
marker of the invention in said sequence or at any other location which is appropriate for their 
5 intended use in sequencing, amplification or the location of novel sequences or markers. 

Primers with their 3' ends located 1 nucleotide upstream of an biallelic marker of the invention 
have a special utility as microsequencing assays. Preferred microsequencing primers are 
described in Table 6. 

The probes of the present invention may be designed from the disclosed sequences for 

10 any method known in the art, particularly methods which allow for testing if a particular 

sequence or marker disclosed herein is present. A preferred set of probes may be designed for 
use in the hybridization assays of the invention in any manner known in the art such that they 
selectively bind to one allele of a biallelic marker or other polymorphism, but not the other 
under any particular set of assay conditions. Preferred hybridization probes may consists of, 

15 consist essentially of, or comprise a contiguous span which ranges in length from 8, 10, 12, 1 5, 

18 or 20 to 25, 35, 40, 50, 60, 70, or 80 nucleotides, or be specified as being 12, 15, 18, 20, 25, 
35, 40, or 50 nucleotides in length and including an biallelic marker or other polymorphism of 
the invention in said sequence. In a preferred embodiment, either of allele 1 or 2 disclosed in 
Table 5 may be specified as being present at the biallelic marker site. In another preferred 

20 embodiment, said biallelic marker may be within 6, 5, 4, 3, 2, or 1 nucleotides of the center of 

the hybridization probe or at the center of said probe. 

In one embodiment the invention encompasses isolated, purified, and recombinant 
polynucleotides comprising, consisting of, or consisting essentially of a contiguous span of 8 to 
50 nucleotides of any one of SEQ ID Nos 1 to 1 7, 24, 25, 28, 74 and 79 to 132 and the 

25 complement thereof, wherein said span includes a biallelic marker of the invention, a 

chromosome 13q3 l-q33-related biallelic marker or g35030-related biallelic marker in said 
sequence; optionally, wherein said polymorphism or biallelic marker selected from the group 
consisting of Al to A127, and the complements thereof, or optionally the biallelic markers in 
linkage disequilibrium therewith; optionally, wherein said biallelic marker is selected from the 

30 group consisting of A13 to A18, A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, 

A76 and A123 ; optionally, wherein said biallelic marker is selected from the group consisting 
of A13 to A65, and the complements thereof, or optionally the biallelic markers in linkage 
disequilibrium therewith; optionally, wherein said contiguous span is 18 to 35 nucleotides in 
length and said biallelic marker is within 4 nucleotides of the center of said polynucleotide; 

35 optionally, wherein said polynucleotide consists of said contiguous span and said contiguous 
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span is 25 nucleotides in length and said biallelic marker is at the center of said polynucleotide; 
optionally, wherein the 3' end of said contiguous span is present at the 3 1 end of said 
polynucleotide; and optionally, wherein the 3' end of said contiguous span is located at the 3' 
end of said polynucleotide and said biallelic marker is present at the 3* end of said 

5 polynucleotide. In a preferred embodiment, said probes comprise, consists of, or consists 

essentially of a sequence selected from the following sequences: PI to P127 and the 
complementary sequences thereto. 

In another embodiment the invention encompasses isolated, purified and recombinant 
polynucleotides comprising, consisting of, or consisting essentially of a contiguous span of 8 to 

10 50 nucleotides of any one of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 79 to 132, or the 

complement thereof, wherein the 3' end of said contiguous span is located at the 3* end of said 
polynucleotide, and wherein the 3' end of said polynucleotide is located within 20 nucleotides 
upstream of a polymorphism of the invention, chromosome 13q31-q33-related biallelic marker, 
or g35030-related biallelic marker in said sequence; optionally, wherein said polymorphism or 

15 biallelic marker is selected from the group consisting of A 1 to A127, and the complements 

thereof, or optionally the biallelic markers in linkage disequilibrium therewith; optionally, 
wherein said biallelic marker is selected from the group consisting of A13 to Al 8, A20 to A46, 
A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 , and the complements thereof, 
or optionally the biallelic markers in linkage disequilibrium therewith; optionally, wherein said 

20 biallelic marker is selected from the group consisting of A13 to A65; optionally, wherein the 3' 

end of said polynucleotide is located 1 nucleotide upstream of said polymorphism or biallelic 
marker; and optionally, wherein said polynucleotide comprises, consists of, or consists 
essentially of a sequence selected from the following sequences: Dl to D127 and El to E127. 
In a further embodiment, the invention encompasses isolated, purified, or recombinant 

25 polynucleotides comprising, consisting of, or consisting essentially of a sequence selected from 

the following sequences: B 1 to B 1 14 and C 1 to C 1 1 4. 

In an additional embodiment, the invention encompasses polynucleotides for use in 
hybridization assays, sequencing assays, and enzyme-based mismatch detection assays for 
determining the identity of the nucleotide at a polymorphism or biallelic marker in any of SEQ 

30 ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or the complement thereof, as well as 

polynucleotides for use in amplifying segments of nucleotides comprising a polymophism or 
biallelic marker of the invention in any of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 79 to 132 or 
the complement thereof; optionally, wherein said polymorphism or biallelic marker is selected 
from the group consisting of A 1 to A 127, and the complements thereof, or optionally the 

35 biallelic markers in linkage disequilibrium therewith; optionally, wherein said biallelic marker 
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is selected from the group consisting of Al 3 to Al 8, A20 to A46, A49 to A52, A55, A57, A59 
to A63 3 A72 to A73, A76 and A 123 , and the complements thereof, or optionally the biallelic 
markers in linkage disequilibrium therewith; optionally, wherein said biallelic marker is 
selected from the group consisting of A13 to A65, and the complements thereof, or optionally 
5 the biallelic markers in linkage disequilibrium therewith. 

These arrays may generally be produced using mechanical synthesis methods or light 
directed synthesis methods, which incorporate a combination of photolithographic methods and 
solid phase oligonucleotide synthesis (Fodor et al., Science, 251:767-777, 1991). The 
immobilization of arrays of oligonucleotides on solid supports has been rendered possible by 

10 the development of a technology generally identified as "Very Large Scale Immobilized 

Polymer Synthesis" (VLSIPS™) in which, typically, probes are immobilized in a high density 
array on a solid surface of a chip. Examples of VLSIPS™ technologies are provided in US 
Patents 5,143,854 and 5,412,087 and in PCT Publications WO 90/15070, WO 92/10092 and 
WO 95/1 1 995, which describe methods for forming oligonucleotide arrays through techniques 

15 such as light-directed synthesis technique. In designing strategies aimed at providing arrays of 

nucleotides immobilized on solid supports, further presentation strategies were developed to 
order and display the oligonucleotide arrays on the chips in an attempt to maximize 
hybridization patterns and sequence information. Examples of such presentation strategies are 
disclosed in PCT Publications WO 94/12305, WO 94/1 1530, WO 97/29212 and WO 97/31256. 

20 Oligonucleotide arrays may comprise at least one of the sequences selected from the 

group consisting of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132; and the sequences 
complementary thereto or a fragment thereof of at least 8, 10, 12, 15,18, 20, 25, 35, 40, 50, 70, 
80, 100, 250, 500 , 1000 or 2000 consecutive nucleotides, to the extent that fragments of these 
lengths is consistent with the lengths of the particular Sequence ID, for determining whether a 

25 sample contains one or more alleles of the biallelic markers of the present invention. 

Oligonucleotide arrays may also comprise at least one of the sequences selected from the group 
consisting of, SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132; and the sequences 
complementary thereto or a fragment thereof of at least 8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 
80, 100, 250, 500 , 1 000 or 2000 consecutive nucleotides, to the extent that fragments of these 

30 lengths is consistent with the lengths of the particular Sequence ID, for amplifying one or more 

alleles of the biallelic markers of Table 5. In other embodiments, arrays may also comprise at 
least one of the sequences selected from the group consisting of SEQ ID Nos. 1 to 17, 24, 25, 
28, 74 and 79 to 132; and the sequences complementary thereto or a fragment thereof of at least 
8, 10, 12, 15, 18, 20, 25, 35, 40, 50, 70, 80, 100, 250, 500 , 1000 or 2000 consecutive 

35 nucleotides, to the extent that fragments of these lengths is consistent with the lengths of the 
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particular Sequence ID, for conducting microsequencing analyses to determine whether a 
sample contains one or more alleles of the biallelic markers of the invention. In still further 
embodiments, the oligonucleotide array may comprise at least one of the sequences selecting 
from the groupVonsisting of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132; and the 
5 sequences complementary thereto or a fragment thereof of at least 8, 10, 12, IS, 18, 20, 25,35, 

40, 50, 70, 80, 100, 250, 500 , 1000 or 2000 nucleotides in length, to the extent that fragments 
of these lengths is consistent with the lengths of the particular Sequence ID, for determining 
whether a sample contains one or more alleles of the polymorphisms and biallelic markers of 
the present invention. 

10 A further object of the invention relates to an array of nucleic acid sequences 

comprising either at least one of the sequences selected from the group consisting of PI to 
P127,B1 toB114,Cl toC114, Dl to DL27E1 to El 27 or the sequences complementary thereto 
or a fragment thereof of at least 8, 10, 12, 1 5, 18, 20, 25, 30, or 40 consecutive nucleotides 
thereof, or at least one sequence comprising at least 1, 2, 3, 4, 5, 10, 20 biallelic markers 

1 5 selected from the group consisting of A 1 to A 1 27 or the complements thereof. The invention 

also pertains to an array of nucleic acid sequences comprising either at least 1, 2, 3, 4, 5, 10, 20 
of the sequences selected from the group consisting of PI to P127, Bl to Bl 14, CI to CI 14, Dl 
to D127, El to E127 or the sequences complementary thereto or a fragment thereof of at least 8 
consecutive nucleotides thereof, or at least two sequences comprising a biallelic marker selected 

20 from the group consisting of Al to A 1 27 or the complements thereto. In preferred 

embodiments, said biallelic markers are selected from the group consisting of A13 to A18, A20 
to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 . In preferred 
embodiments, said probes, amplification primers and microsequencing primers correspond in 
Table 6 to said biallelic markers. 

25 The present invention also encompasses diagnostic kits comprising one or more 

polynucleotides of the invention, optionally with a portion or all of the necessary reagents and 
instructions for genotyping a test subject by determining the identity of a nucleotide at an 
biallelic marker of the invention. The polynucleotides of a kit may optionally be attached to a 
solid support, or be part of an array or addressable array of polynucleotides. The kit may 

30 provide for the determination of the identity of the nucleotide at a marker position by any 

method known in the art including, but not limited to, a sequencing assay method, a 
microsequencing assay method, a hybridization assay method, or enzyme-based mismatch 
detection assay. Optionally such a kit may include instructions for scoring the results of the 
determination with respect to the test subjects' predisposition to schizophrenia, or likely 

35 response to an agent acting on schizophrenia, or chances of suffering from side effects to an 
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agent acting on schizophrenia. 

Optionally, in any embodiments described herein, a biallelic marker of the invention 
may comprise: 

(a) a biallelic marker selected from the group consisting of g35030-related markers A13 
5 to A65; (b) a biallelic marker selected from the group consisting of chromosome 13q3 l-q33- 

related-biallelic markers Al 3 to A18, A20 to A46, A49 to A52 ? A55, A57, A59 to A63, A72 to 
A73, A76 and A123; or (c) a biallelic marker selected from the group consisting of chromosome 
13q31-q33-related-biallelic markers A13 to A18, A20 to A47, A49 to A52, A55, A57, A59 to 
A63. Furthermore, any of the embodiments of the invention may specifically exclude one or 
10 more of the biallelic markers selected from biallelic markers Al to A127. 

Methods For DeNovo Identification Of Biallelic Markers 

Any of a variety of methods can be used to screen a genomic fragment for single 
nucleotide polymorphisms such as differential hybridization with oligonucleotide probes, 
15 detection of changes in the mobility measured by gel electrophoresis or direct sequencing of the 

amplified nucleic acid. A preferred method for identifying biallelic markers involves 
comparative sequencing of genomic DNA fragments from an appropriate number of unrelated 
individuals. 

In a first embodiment, DNA samples from unrelated individuals are pooled together, 

20 following which the genomic DNA of interest is amplified and sequenced. The nucleotide 

sequences thus obtained are then analyzed to identify significant polymorphisms. One of the 
major advantages of this method resides in the fact that the pooling of the DNA samples 
substantially reduces the number of DNA amplification reactions and sequencing reactions, 
which must be carried out. Moreover, this method is sufficiently sensitive so that a biallelic 

25 marker obtained thereby usually demonstrates a sufficient frequency of its less common allele 

to be useful in conducting association studies. Usually, the frequency of the least common 
allele of a biallelic marker identified by this method is at least 1 0%. 

In a second embodiment, the DNA samples are not pooled and are therefore amplified 
and sequenced individually. This method is usually preferred when biallelic markers need to be 

30 identified in order to perform association studies within candidate genes. Preferably, highly 

relevant gene regions such as promoter regions or exon regions may be screened for biallelic 
markers. A biallelic marker obtained using this method may show a lower degree of 
informativeness for conducting association studies, e.g. if the frequency of its less frequent 
allele may be less than about 10%. Such a biallelic marker will however be sufficiently 

35 informative to conduct association studies and it will further be appreciated that including less 
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informative biallelic markers in the genetic analysis studies of the present invention, may allow 
in some cases the direct identification of causal mutations, which may, depending on their 
penetrance, be rare mutations. 

The following is a description of the various parameters of a preferred method used by 
5 the inventors for the identification of the biallelic markers of the present invention. 

Genomic DNA samples 

The genomic DNA samples from which the biallelic markers of the present invention 
are generated are preferably obtained from unrelated individuals corresponding to a 
heterogeneous population of known ethnic background. The number of individuals from whom 
10 DNA samples are obtained can vary substantially, preferably from about 10 to about 1000, more 

preferably from about 50 to about 200 individuals. Usually, DNA samples are collected from at 
least about 1 00 individuals in order to have sufficient polymorphic diversity in a given 
population to identify as many markers as possible and to generate statistically significant 
results* 

15 As for the source of the genomic DNA to be subjected to analysis, any test sample can 

be foreseen without any particular limitation. These test samples include biological samples, 
which can be tested by the methods of the present invention described herein, and include 
human and animal body fluids such as whole blood, serum, plasma, cerebrospinal fluid, urine, 
lymph fluids, and various external secretions of the respiratory, intestinal and genitourinary 

20 tracts, tears, saliva, milk, white blood cells, myelomas and the like; biological fluids such as cell 

culture supernatants; fixed tissue specimens including tumor and non-tumor tissue and lymph 
node tissues; bone marrow aspirates and fixed cell specimens. The preferred source of genomic 
DNA used in the present invention is from peripheral venous blood of each donor. Techniques 
to prepare genomic DNA from biological samples are well known to the skilled technician. 

25 Details of a preferred embodiment are provided in Example 1 . The person skilled in the art can 

choose to amplify pooled or unpooled DNA samples. 
DNA Amplification 

The identification of biallelic markers in a sample of genomic DNA may be facilitated 
through the use of DNA amplification methods. DNA samples can be pooled or unpooled for 
30 the amplification step. DNA amplification techniques are well known to those skilled in the art. 

Various methods to amplify DNA fragments carrying biallelic markers are further described 
hereinafter herein. The PGR technology is the preferred amplification technique used to 
identify new biallelic markers. 

In a first embodiment, biallelic markers are identified using genomic sequence 
35 information generated by the inventors. Genomic DNA fragments, such as the inserts of the 
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BAC clones described above, are sequenced and used to design primers for the amplification of 
500 bp fragments. These 500 bp fragments are amplified from genomic DNA and are scanned 
for biallelic markers. Primers may be designed using the OSP software (Hillier L. and Green 
P., 1991). All primers may contain, upstream of the specific target bases, a common 

5 oligonucleotide tail that serves as a sequencing primer. Those skilled in the art are familiar with 

primer extensions, which can be used for these purposes. 

Jn another embodiment of the invention, genomic sequences of candidate genes are 
available in public databases allowing direct screening for biallelic markers. Preferred primers, 
useful for the amplification of genomic sequences encoding the candidate genes, focus on 

10 promoters, exons and splice sites of the genes. A biallelic marker present in these functional 

regions of the gene have a higher probability to be a causal mutation. 

Sequencing Of Amplified Genomic DNA And Identification Of Single Nucleotide 
Polymorphisms 

The amplification products generated as described above, are then sequenced using any 

1 5 method known and available to the skilled technician. Methods for sequencing DNA using 

either the dideoxy-mediated method (Sanger method) or the Maxam-Gilbert method are widely 
known to those of ordinary skill in the art. Such methods are for example disclosed in Maniatis 
et al. (Molecular Cloning, A Laboratory Manual, Cold Spring Harbor Press, Second Edition, . 
1989). Alternative approaches include hybridization to high-density DNA probe arrays as 

20 described in Chee et al. (Science 274, 610, 1 996). 

Preferably, the amplified DNA is subjected to automated dideoxy terminator 
sequencing reactions using a dye-primer cycle sequencing protocol. The products of the 
sequencing reactions are run on sequencing gels and the sequences are determined using gel 
image analysis. The polymorphism search is based on the presence of superimposed peaks in 

25 the electrophoresis pattern resulting from different bases occurring at the same position. 

Because each dideoxy terminator is labeled with a different fluorescent molecule, the two peaks 
corresponding to a biallelic site present distinct colors corresponding to two different 
nucleotides at the same position on the sequence. However, the presence of two peaks can be 
an artifact due to background noise. To exclude such an artifact, the two DNA strands are 

30 sequenced and a comparison between the peaks is carried out. In order to be registered as a 

polymorphic sequence, the polymorphism has to be detected on both strands. 

The above procedure permits those amplification products, which contain biallelic 
markers to be identified. The detection limit for the frequency of biallelic polymorphisms 
detected by sequencing pools of 1 00 individuals is approximately 0.1 for the minor allele, as 

35 verified by sequencing pools of known allelic frequencies. However, more than 90% of the 
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biallelic polymorphisms detected by the pooling method have a frequency for the minor allele 
higher than 0.25. Therefore, the biallelic markers selected by this method have a frequency of 
at least 0.1 for the minor allele and less than 0.9 for the major allele. Preferably at least 0.2 for 
the minor allele and less than 0.8 for the major allele, more preferably at least 0.3 for the minor 
5 allele and less than 0.7 for the major allele, thus a heterozygosity rate higher than 0. 1 8, 

preferably higher than 0.32, more preferably higher than 0.42. 

In another embodiment, biallelic markers are detected by sequencing individual DNA 
samples, the frequency of the minor allele of such a biallelic marker may be less than 0.1. 

10 Validation of the biallelic markers of the present invention 

The polymorphisms are evaluated for their usefulness as genetic markers by validating 
that both alleles are present in a population. Validation of the biallelic markers is accomplished 
by genotyping a group of individuals by a method of the invention and demonstrating that both 
alleles are present. M icrosequencing is a preferred method of genotyping alleles. The 

1 5 validation by genotyping step may be performed on individual samples derived from each 

individual in the group or by genotyping a pooled sample derived from more than one 
individual. The group can be as small as one individual if that individual is heterozygous for 
the allele in question. Preferably the group contains at least three individuals, more preferably 
the group contains five or six individuals, so that a single validation test will be more likely to 

20 result in the validation of more of the biallelic markers that are being tested. It should be noted, 

however, that when the validation test is performed on a small group it may result in a false 
negative result if as a result of sampling error none of the individuals tested carries one of the 
two alleles. Thus, the validation process is less useful in demonstrating that a particular initial 
result is an artifact, than it is at demonstrating that there is a bona fide biallelic marker at a 

25 particular position in a sequence. All of the genotyping, hapiotyping, association, and 

interaction study methods of the invention may optionally be performed solely with validated 
biallelic markers. 



Evaluation of the frequency of the biallelic markers of the present invention 

30 The validated biallelic markers are further evaluated for their usefulness as genetic 

markers by determining the frequency of the least common allele at the biallelic marker site. 
The determination of the least common allele is accomplished by genotyping a group of 
individuals by a method of the invention and demonstrating that both alleles are present. This 
determination of frequency by genotyping step may be performed on individual samples derived 

35 from each individual in the group or by genotyping a pooled sample derived from more than 
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one individual. The group must be large enough to be representative of the population as a 
whole. Preferably the group contains at least 20 individuals, more preferably the group contains 
at least 50 individuals, most preferably the group contains at least 100 individuals. Of course 
the larger the group the greater the accuracy of the frequency determination because of reduced 

5 sampling error. A biallelic marker wherein the frequency of the less common allele is 30% or 

more is termed a "high quality biallelic marker." AH of the genotyping, haplotyping, 
association, and interaction study methods of the invention may optionally be performed solely 
with high quality biallelic markers. 

Another embodiment of the invention comprises methods of estimating the frequency 

10 of an allele in a population comprising genotyping individuals from said population for a 

13q31-q33-related biallelic marker and determining the proportional representation of said 
biallelic marker in said population. In addition, the methods of estimating the frequency of an 
allele in a population encompass methods with any further limitation described in this 
disclosure, or those following, specified alone or in any combination: Optionally, said 13q31- 

15 q33-related biallelic marker may be in a sequence selected individually or in any combination 

from the group consisting of SEQ Nos 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32; and the 
complements thereof; optionally, said 13q31-q33-related biallelic marker may be selected from 
the biallelic markers described in Table 5; optionally, determining the frequency of a biallelic 
marker allele in a population may be accomplished by determining the identity of the 

20 nucleotides for both copies of said biallelic marker present in the genome of each individual in 

said population and calculating the proportional representation of said nucleotide at said 13q3 1- 
q33-related biallelic marker for the population; optionally, determining the frequency of a 
biallelic marker allele in a population may be accomplished by performing a genotyping method 
on a pooled biological sample derived from a representative number of individuals, or each 

25 individual, in said population, and calculating the proportional amount of said nucleotide 

compared with the total. 

Methods Of Genotyping An Individual For Biallelic Markers 
Methods are provided to genotype a biological sample for one or more biallelic markers 
of the present invention, all of which may be performed in vitro. Such methods of genotyping 

30 comprise determining the identity of a nucleotide at an biallelic marker of the invention by any 

method known in the art. These methods find use in genotyping case-control populations in 
association studies as well as individuals in the context of detection of alleles of biallelic 
markers which, are known to be associated with a given trait, in which case both copies of the 
biallelic marker present in individual's genome are determined so that an individual may be 

35 classified as homozygous or heterozygous for a particular allele. 
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These genotyping methods can be performed nucleic acid samples derived from a single 
individual or pooled DNA samples. 

Genotyping can be performed using similar methods as those described above for the 
identification of the biallelic markers, or using other genotyping methods such as those further 
5 described below. In preferred embodiments, the comparison of sequences of amplified genomic 

fragments from different individuals is used to identify new biallelic markers whereas 
microsequencing is used for genotyping known biallelic markers in diagnostic and association 
study applications. 

Another embodiment of the invention encompasses methods of genotyping a biological 
10 sample comprising determining the identity of a nucleotide at a 13q3 l-q33-related biallelic 

marker. Thus, the invention encompasses a method of genotyping an individual comprising: (a) 
obtaining a biological sample comprising a nucleic acid from said individual; (b) determining 
the identity of a polymorphic base at a biallelic marker from said nucleic acid; wherein said 
polymorphic base is selected from the group consisting of the first or second respective allele at 
15 the nucleotide position of the polymorphic base of SEQ ID No 1, 24, 25, 28, or 74 as indicated 

in table 5; wherein the identity of the polymorphic base determines the genotype of the 
individual at said position. 

In addition, the genotyping methods of the invention encompass methods with any 
further limitation described in this disclosure, or those following, specified alone or in any 
20 combination: Optionally, said 13q3 l-q33-related biallelic marker may be in a sequence 

selected individually or in any combination from the group consisting of SEQ ID Nos 1 to 17, 
24, 25, 28, 74 and 79 to 132, and the complements thereof; optionally, said 13q3 l-q33-related 
biallelic marker may be selected individually or in any combination from the biallelic markers 
described in Table 5; optionally, said method further comprises determining the identity of a 
25 second nucleotide at said biallelic marker, wherein said first nucleotide and second nucleotide 

are not base paired (by Watson & Crick base pairing) to one another; optionally, said biological 
sample is derived from a single individual or subject; optionally, said method is performed in 
vitro; optionally, said biallelic marker is determined for both copies of said biallelic marker 
present in said individual's genome; optionally, said biological sample is derived from multiple 
30 subjects or individuals; optionally, said method further comprises amplifying a portion of said 

sequence comprising the biallelic marker prior to said determining step; optionally, wherein 
said amplifying is performed by PGR, LCR, or replication of a recombinant vector comprising 
an origin of replication and said portion in a host cell; optionally, wherein said determining is 
performed by a hybridization assay, sequencing assay, microsequencing assay, or an enzyme- 
35 based mismatch detection assay, 
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Source of DNA for genotyping 

Any source of nucleic acids, in purified or non-purified form, can be utilized as the 
starting nucleic acid, provided it contains or is suspected of containing the specific nucleic acid 
sequence desired. DNA or RNA may be extracted from cells, tissues, body fluids and the like 

5 as described herein. While nucleic acids for use in the genotyping methods of the invention can 

be derived from any mammalian source, the test subjects and individuals from which nucleic 
acid samples are taken are generally understood to be human. 

Amplification Of DNA Fragments Comprising Biallelic Markers 
Methods and polynucleotides are provided to amplify a segment of nucleotides 

1 0 comprising one or more biallelic marker of the present invention. It will be appreciated that 

amplification of DM A fragments comprising biallelic markers may be used in various methods 
and for various purposes and is not restricted to genotyping. Nevertheless, many genotyping 
methods, although not all, require the previous amplification of the DNA region carrying the 
biallelic marker of interest. Such methods specifically increase the concentration or total 

1 5 number of sequences that span the biallelic marker or include that site and sequences located 

either distal or proximal to it. Diagnostic assays may also rely on amplification of DNA 
segments carrying a biallelic marker of the present invention. 

Amplification of DNA may be achieved by any method known in the art. The 
established PCR (polymerase chain reaction) method or by developments thereof or 

20 alternatives. Amplification methods which can be utilized herein include but are not limited to 

Ligase Chain Reaction (LCR) as described in EP A 320 308 and EP A 439 1 82, Gap LCR 
(Wolcott, M.J.), the so-called "NASBA" or "3SR" technique described in Guatelli J.C. et al. 
(1990)andin ComptonJ. (1991), Q-beta amplification as described in EPA 4544 610, strand 
displacement amplification as described in Walker et al. (1996) and EP A 684 315 and, target 

25 mediated amplification as described in PCX Publication WO 9322461 . 

LCR and Gap LCR are exponential amplification techniques, both depend on DNA 
ligase to join adjacent primers annealed to a DNA molecule. In Ligase Chain Reaction (LCR), 
probe pairs are used which include two primary (first and second) and two secondary (third and 
fourth) probes, all of which are employed in molar excess to target. The first probe hybridizes 

30 to a first segment of the target strand and the second probe hybridizes to a second segment of 

the target strand, the first and second segments being contiguous so that the primary probes abut 
one another in 5' phosphate-3'hydroxyl relationship, and so that a ligase can covalently fuse or 
ligate the two probes into a fused product. In addition, a third (secondary) probe can hybridize 
to a portion of the first probe and a fourth (secondary) probe can hybridize to a portion of the 

35 second probe in a similar abutting fashion. Of course, if the target is initially double stranded, 
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the secondary probes also will hybridize to the target complement in the first instance. Once the 
ligated strand of primary probes is separated from the target strand, it will hybridize with the 
third and fourth probes which can be ligated to form a complementary, secondary ligated 
product. It is important to realize that the ligated products are functionally equivalent to either 
5 the target or its complement. By repeated cycles of hybridization and ligation, amplification of 

the target sequence is achieved. A method for multiplex LCR has also been described (WO 
9320227). Gap LCR (GLCR) is a version of LCR where the probes are not adjacent but are 
separated by 2 to 3 bases. 

For amplification of mRNAs, it is within the scope of the present invention to reverse 
1 0 transcribe mRN A into cDN A followed by polymerase chain reaction (RT-PCR); or, to use a 

single enzyme for both steps as described in U.S. Patent No. 5,322,770 or, to use Asymmetric 
Gap LCR (RT-AGLCR) as described by Marshall R.L. et al. (1 994). AGLCR is a modification 
of GLCR that allows the amplification of RNA. 

Some of these amplification methods are particularly suited for the detection of single 
1 5 nucleotide polymorphisms and allow the simultaneous amplification of a target sequence and 

the identification of the polymorphic nucleotide as it is further described herein. 

The PCR technology is the preferred amplification technique used in the present 
invention, A variety of PCR techniques are familiar to those skilled in the art. For a review of 
PCR technology, see Molecular Cloning to Genetic Engineering White, B.A. Ed. (1997) and the 
20 publication entitled "PCR Methods and Applications" (1991, Cold Spring Harbor Laboratory 

Press). In each of these PCR procedures, PCR primers on either side of the nucleic acid 
sequences to be amplified are added to a suitably prepared nucleic acid sample along with 
dNTPs and a thermostable polymerase such as Taq polymerase, Pfu polymerase, or Vent 
polymerase. The nucleic acid in the sample is denatured and the PCR primers are specifically 
25 hybridized to complementary nucleic acid sequences in the sample. The hybridized primers are 

extended. Thereafter, another cycle of denaturation, hybridization, and extension is initiated. 
The cycles are repeated multiple times to produce an amplified fragment containing the nucleic 
acid sequence between the primer sites. PCR has further been described in several patents 
including US Patents 4,683,195, 4,683,202 and 4,965,188. 
30 Primers can be prepared by any suitable method. As for example, direct chemical 

synthesis by a method such as the phosphodiester method of Narang S.A. et al. (1979), the 
phosphodiester method of Brown E.L. et al. (1979), the diethylphosphoramidite method of 
Beaucage et al. (1981) and the solid support method described in EP 0 707 592. 

In some embodiments the present invention provides primers for amplifying a DNA 
35 fragment containing one or more biallelic markers of the present invention. It will be 
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appreciated that the primers listed are merely exemplary and that any other set of primers which 
produce amplification products containing one or more biallelic markers of the present 
invention. 

The spacing of the primers determines the length of the segment to be amplified. In the 
5 context of the present invention amplified segments carrying biallelic markers can range in size 

from at least about 25 bp to 35 kbp. Amplification fragments from 25-3000 bp are typical, 
fragments from 50-1000 bp are preferred and fragments from 100-600 bp are highly preferred. 
It will be appreciated that amplification primers for the biallel ic markers may be any sequence 
which allow the specific amplification of any DNA fragment carrying the markers. 
10 Amplification primers may be labeled or immobilized on a solid support as described in the 

section titled "Oligonucleotide Probes and Primers". 

Methods of Genotyping DNA samples for Biallelic Markers 

Any method known in the art can be used to identify the nucleotide present at a biallelic 
marker site. Since the biallelic marker allele to be detected has been identified and specified in 

15 the present invention, detection will prove simple for one of ordinary skill in the art by 

employing any of a number of techniques. Many genotyping methods require the previous 
amplification of the DNA region carrying the biallelic marker of interest. While the 
amplification of target or signal is often preferred at present, ultrasensitive detection methods 
which do not require amplification are also encompassed by the present genotyping methods. 

20 Methods well-known to those skilled in the art that can be used to detect biallelic 

polymorphisms include methods such as, conventional dot blot analyzes, single strand 
conformational polymorphism analysis (SSCP) described by Orita et al. (1989), denaturing 
gradient gel electrophoresis (DGGE), heteroduplex analysis, mismatch cleavage detection, and 
other conventional techniques as described in Sheffield, V.C, et al. (1991), White et al. (1992), 

25 Grompe, M. et al. (1989) and Grompe, M. (1993). Another method for determining the identity 

of the nucleotide present at a particular polymorphic site employs a specialized exonuclease- 
resistant nucleotide derivative as described in US patent 4,656,127. 

Preferred methods involve directly determining the identity of the nucleotide present at 
a biallelic marker site by sequencing assay, enzyme-based mismatch detection assay, or 

30 hybridization assay. The following is a description of some preferred methods. A highly 

preferred method is the microsequencing technique. The term "sequencing assay" is used 
herein to refer to polymerase extension of duplex primer/template complexes and includes both 
traditional sequencing and microsequencing. 
1) Sequencing assays 

35 The nucleotide present at a polymorphic site can be determined by sequencing methods. 
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In a preferred embodiment, DN A samples are subjected to PCR amplification before 
sequencing as described above, DNA sequencing methods are described in herein. Preferably, 
the amplified DNA is subjected to automated dideoxy terminator sequencing reactions using a 
dye-primer cycle sequencing protocol. Sequence analysis allows the identification of the base 
5 present at the biallelic marker site. 

2) Microsequencing assays 

In microsequencing methods, a nucleotide at the polymorphic site that is unique to one 
of the alleles in a target DNA is detected by a single nucleotide primer extension reaction. This 
method involves appropriate microsequencing primers which, hybridize just upstream of a 

10 polymorphic base of interest in the target nucleic acid. A polymerase is used to specifically 

extend the 3' end of the primer with one single ddNTP (chain terminator) complementary to the 
selected nucleotide at the polymorphic site. Next the identity of the incorporated nucleotide is 
determined in any suitable way. 

Typically, microsequencing reactions are carried out using fluorescent ddNTPs and the 

15 extended microsequencing primers are analyzed by electrophoresis on ABI 377 sequencing 

machines to determine the identity of the incorporated nucleotide as described in EP 412 883. 
Alternatively capillary electrophoresis can be used in order to process a higher number of assays 
simultaneously. An example of a typical microsequencing procedure that can be used in the 
context of the present invention is provided in example 4. 

20 Different approaches can be used to detect the nucleotide added to the microsequencing 

primer, A homogeneous phase detection method based on fluorescence resonance energy 
transfer has been described by Chen and Kwok (1997) and Chen et al. (1997). In this method 
amplified genomic DNA fragments containing polymorphic sites are 1 incubated with a 5- 
fluorescein-labeled primer in the presence of allelic dye-labeled dideoxy ribonucleoside 

25 triphosphates and a modified Taq polymerase. The dye-labeled primer is extended one base by 

the dye-terminator specific for the allele present on the template. At the end of the genotyping 
reaction, the fluorescence intensities of the two dyes in the reaction mixture are analyzed 
directly without separation or purification. All these steps can be performed in the same tube 
and the fluorescence changes can be monitored in real time. Alternatively, the extended primer 

30 may be analyzed by MALDI-TOF Mass Spectrometry. The base at the polymorphic site is 

identified by the mass added onto the microsequencing primer (see Haff L.A. and Smirnov LP., 
1997). 

Microsequencing may be achieved by the established microsequencing method or by 
developments or derivatives thereof. Alternative methods include several solid-phase 
35 microsequencing techniques. The basic microsequencing protocol is the same as described 
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previously, except that the method is conducted as a heterogenous phase assay, in which the 
primer or the target molecule is immobilized or captured onto a solid support. To simplify the 
primer separation and the terminal nucleotide addition analysis, oligonucleotides are attached to 
solid supports or are modified in such ways that permit affinity separation as well as polymerase 
5 extension. The 5' ends and internal nucleotides of synthetic oligonucleotides can be modified in 

a number of different ways to permit different affinity separation approaches, e.g., biotinylation. 
If a single affinity group is used on the oligonucleotides, the oligonucleotides can be separated 
from the incorporated terminator regent. This eliminates the need of physical or size separation. 
More than one oligonucleotide can be separated from the terminator reagent and analyzed 

10 simultaneously if more than one affinity group is used. This permits the analysis of several 

nucleic acid species or more nucleic acid sequence information per extension reaction. The 
affinity group need not be on the priming oligonucleotide but could alternatively be present on 
the template. For example, immobilization can be carried out via an interaction between 
biotinylated DNA and streptavidin-coated microtitration wells or avidin-coated polystyrene 

15 particles. In the same manner oligonucleotides or templates may be attached to a solid support 

in a high-density format. In such solid phase microsequericing reactions, incorporated ddNTPs 
can be radiolabeled (Syvanen, 1994) or linked to fluorescein (Livak and Hainer, 1994). The 
detection of radiolabeled ddNTPs can be achieved through scintillation-based techniques. The 
detection of fluorescein-linked ddNTPs can be based on the binding of antifluorescein antibody 

20 conjugated with alkaline phosphatase, followed by incubation with a chromogenic substrate 

(such as /7-nitrophenyl phosphate). Other possible reporter-detection pairs include: ddNTP 
linked to dinitrophenyl (DNP) and anti-DNP alkaline phosphatase conjugate (Harju et al., 1993) 
or biotinylated ddNTP and horseradish peroxidase-conjugated streptavidin with o 
phenylenediamine as a substrate (WO 92/1 5712). As yet another alternative solid-phase 

25 microsequencing procedure, Nyren et al. (1993) described a method relying on the detection of 

DNA polymerase activity by an enzymatic luminometric inorganic pyrophosphate detection 
assay (ELIDA). 

Pastinen et al. (1997), describe a method for multiplex detection of single nucleotide 
polymorphism in which the solid phase minisequencing principle is applied to an 
30 oligonucleotide array format. High-density arrays of DNA probes attached to a solid support 

(DNA chips) are further described in herein. 

In one aspect the present invention provides polynucleotides and methods to genotype 
one or more biallelic markers of the present invention by performing a microsequencing assay. 
Preferred microsequencing primers include those being featured Table 6. It will be appreciated 
35 that the microsequencing primers listed in Table 6 are merely exemplary and that, any primer 
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having a 3' end immediately adjacent to a polymorphic nucleotide may be used. Similarly, it 
will be appreciated that microsequencing analysts may be performed for any biallelic marker or 
any combination of biallelic markers of the present invention. One aspect of the present 
invention is a solid support which includes one or more microsequencing primers listed in Table 
5 6, or fragments comprising at least 8, at least 1 2, at least 15, or at least 20 consecutive 

nucleotides thereof and having a 3' terminus immediately upstream of the corresponding 
biallelic marker, for determining the identity of a nucleotide at biallelic marker site. 
3) Mismatch detection assays based on polymerases and ligases 
In one aspect the present invention provides polynucleotides and methods to determine 
10 the allele of one or more biallelic markers of the present invention in a biological sample, by 

mismatch detection assays based on polymerases and/or ligases. These assays are based on the 
specificity of polymerases and ligases. Polymerization reactions places particularly stringent 
requirements on correct base pairing of the 3' end of the amplification primer and the joining of 
two oligonucleotides hybridized to a target DNA sequence is quite sensitive to mismatches 
1 5 close to the ligation site, especially at the 3' end. The terms "enzyme based mismatch detection 

assay" are used herein to refer to any method of determining the allele of a biallelic marker 
based on the specificity of ligases and polymerases. Preferred methods are described below. 
Methods, primers and various parameters to amplify DNA fragments comprising biallelic 
markers of the present invention are further described herein. 
20 Allele specific amplification 

Discrimination between the two alleles of a biallelic marker can also be achieved by 
allele specific amplification, a selective strategy, whereby one of the alleles is amplified without 
amplification of the other allele. This is accomplished by placing a polymorphic base at the 3' 
end of one of the amplification primers. Because the extension forms from the 3'end of the 
25 primer, a mismatch at or near this position has an inhibitory effect on amplification. Therefore, 

under appropriate amplification conditions, these primers only direct amplification on their 
complementary allele. Designing the appropriate allele-specific primer and the corresponding 
assay conditions are well with the ordinary skill in the art. 
Ligation/amplification based methods 
30 The "Oligonucleotide Ligation Assay" (OLA) uses two oligonucleotides which are 

designed to be capable of hybridizing to abutting sequences of a single strand of a target 
molecules. One of the oligonucleotides is biotinylated, and the other is detectably labeled. If 
the precise complementary sequence is found in a target molecule, the oligonucleotides will 
hybridize such that their termini abut, and create a ligation substrate that can be captured and 
35 detected. OLA is capable of detecting biallelic markers and may be advantageously combined 



WO 01/40493 PCT/EB00/Q1507 

78 

with PGR as described by Nickerson D.A. etal.(1990). In this method, PGR is used to achieve 
the exponential amplification of target DNA, which is then detected using OLA. 

Other methods which are particularly suited for the detection of bialielic markers 
include LCR (ligase chain reaction), Gap LCR (GLCR) which are described herein. As 
5 mentioned above LCR uses two pairs of probes to exponentially amplify a specific target. The 

sequences of each pair of oligonucleotides, is selected to permit the pair to hybridize to abutting 
sequences of the same strand of the target. Such hybridization forms a substrate for a template- 
dependant ligase. In accordance with the present invention, LCR can be performed with 
oligonucleotides having the proximal and distal sequences of the same strand of a bialielic 

1 0 marker site. In one embodiment, either oligonucleotide will be designed to include the bialielic 

marker site. In such an embodiment, the reaction conditions are selected such that the 
oligonucleotides can be ligated together only if the target molecule either contains or lacks the 
specific nucleotide(s) that is complementary to the bialielic marker on the oligonucleotide. In 
an alternative embodiment, the oligonucleotides will not include the bialielic marker, such that 

15 when they hybridize to the target molecule, a "gap" is created as described in WO 90/01069. 

his gap is then "filled" with complementary dNTPs (as mediated by DNA polymerase), or by an 
additional pair of oligonucleotides. Thus at the end of each cycle, each single strand has a 
complement capable of serving as a target during the next cycle and exponential allele-specific 
amplification of the desired sequence is obtained. 

20 Ligase/Polymerase-mediated Genetic Bit Analysis™ is another method for determining 

the identity of a nucleotide at a preselected site in a nucleic acid molecule (WO 95/2 1271). This 
method involves the incorporation of a nucleoside triphosphate that is complementary to the 
nucleotide present at the preselected site onto the terminus of a primer molecule, and their 
subsequent ligation to a second oligonucleotide. The reaction is monitored by detecting a 

25 specific label attached to the reaction's solid phase or by detection in solution. 

4) Hybridization assay methods 

A preferred method of determining the identity of the nucleotide present at a bialielic 
marker site involves nucleic acid hybridization. The hybridization probes, which can be 
conveniently used in such reactions, preferably include the probes defined herein. Any 
30 hybridization assay may be used including Southern hybridization, Northern hybridization, dot 

blot hybridization and solid-phase hybridization (see Sambrook et aL, Molecular Cloning - A 
Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., 1989). 

Hybridization refers to the formation of a duplex structure by two single stranded 
nucleic acids due to complementary base pairing. Hybridization can occur between exactly 
35 complementary nucleic acid strands or between nucleic acid strands that contain minor regions 
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of mismatch. Specific probes can be designed that hybridize to one form of a biallelic marker 
and not to the other and therefore are able to discriminate between different allelic forms. 
Allele-specific probes are often used in pairs, one member of a pair showing perfect match to a 
target sequence containing the original allele and the other showing a perfect match to the target 
5 sequence containing the alternative allele. Hybridization conditions should be sufficiently 

stringent that there is a significant difference in hybridization intensity between alleles, and 
preferably an essentially binary response, whereby a probe hybridizes to only one of the alleles. 
Stringent, sequence specific hybridization conditions, under which a probe will hybridize only 
to the exactly complementary target sequence are well known in the art (Sambrook et ah, 

10 Molecular Cloning - A Laboratory Manual, Second Edition, Cold Spring Harbor Press, N.Y., 

1989). Stringent conditions are sequence dependent and will be different in different 
circumstances. Generally, stringent conditions are selected to be about 5°C lower than the 
thermal melting point (Tm) for the specific sequence at a defined ionic strength and pH. By 
way of example and not limitation, procedures using conditions of high stringency are as 

15 follows: Prehybridization of filters containing DNA is carried out for 8 h to overnight at 65°C in 

buffer composed of 6X SSC, 50 mM Tris-HCl (pH 7.5), 1 mM EDTA, 0.02% PVP, 0.02% 
Ficoll, 0.02% BSA, and 500 jig/ml denatured salmon sperm DNA. Filters are hybridized for 
48 h at 65°C, the preferred hybridization temperature, in prehybridization mixture containing 
100 ug/ml denatured salmon sperm DNA and 5-20 X 10 6 cpm of 32 P-Iabeled probe. 

20 Alternatively, the hybridization step can be performed at 65°C in the presence of SSC buffer, 1 

x SSC corresponding to 0,15MNaCl and 0.05 M Na citrate. Subsequently, filter washes can be 
done at 37°C for 1 h in a solution containing 2X SSC, 0.01% PVP, 0.01% Ficoll, and 0.0 1% 
BSA, followed by a wash in 0. 1 X SSC at 50°C for 45 min. Alternatively, filter washes can be 
performed in a solution containing 2 x SSC and 0.1% SDS, or 0.5 x SSC and 0.1% SDS, or 0. 1 

25 x SSC and 0.1% SDS at 68°C for 15 minute intervals. Following the wash steps, the hybridized 

probes are detectable by autoradiography. By way of example and not limitation, procedures 
using conditions of intermediate stringency are as follows: Filters containing DNA are 
prehybridized, and then hybridized at a temperature of 60°C in the presence of a 5 x SSC buffer 
and labeled probe. Subsequently, filters washes are performed in a solution containing 2x SSC 

30 at 50°C and the hybridized probes are detectable by autoradiography. Other conditions of high 

and intermediate stringency which may be used are well known in the art and as cited in 
Sambrook et al. (Molecular Cloning - A Laboratory Manual, Second Edition, Cold Spring 
Harbor Press, N.Y., 1989) and Ausubel et al. (Current Protocols in Molecular Biology, Green 
Publishing Associates and Wiley Interscience, N.Y., 1 989). 

35 Although such hybridizations can be performed in solution, it is preferred to employ a 
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solid-phase hybridization assay. The target DNA comprising a biallelic marker of the present 
invention may be amplified prior to the hybridization reaction. The presence of a specific allele 
in the sample is determined by detecting the presence or the absence of stable hybrid duplexes 
formed between the probe and the target DNA. The detection of hybrid duplexes can be carried 
5 out by a number of methods. Various detection assay formats are well known which utilize 

detectable labels bound to either the target or the probe to enable detection of the hybrid 
duplexes. Typically, hybridization duplexes are separated from unhybridized nucleic acids and 
the labels bound to the duplexes are then detected. Those skilled in the art will recognize that 
wash steps may be employed to wash away excess target DNA or probe. Standard 
1 0 heterogeneous assay formats are suitable for detecting the hybrids using the labels present on 

the primers and probes. 

Two recently developed assays allow hybridization-based allele discrimination with no 
need for separations or washes (see Landegren U. et al.,1998). The TaqMan assay takes 
advantage of the 5 f nuclease activity of Taq DNA polymerase to digest a DNA probe annealed 
15 specifically to the accumulating amplification product. TaqMan probes are labeled with a 

donor-acceptor dye pair that interacts via fluorescence energy transfer. Cleavage of the 
TaqMan probe by the advancing polymerase during amplification dissociates the donor dye 
from the quenching acceptor dye, greatly increasing the donor fluorescence. All reagents 
necessary to detect two allelic variants can be assembled at the beginning of the reaction and the 
20 results are monitored in real time (see Livak et al, 1995). In an alternative homogeneous 

hybridization-based procedure, molecular beacons are used for allele discriminations. 
Molecular beacons are hairpin-shaped oligonucleotide probes that report the presence of 
specific nucleic acids in homogeneous solutions. When they bind to their targets they undergo a 
conformational reorganization that restores the fluorescence of an internally quenched 
25 fluorophore (Tyagi et al., 1998). 

By assaying the hybridization to an allele specific probe, one can detect the presence or 
absence of a biallelic marker allele in a given sample. 

High-Throughput parallel hybridizations in array format are specifically encompassed 
within "hybridization assays" and are described below. 
30 Hybridization to addressable arrays of oligonucleotides 

Hybridization assays based on oligonucleotide arrays rely on the differences in 
hybridization stability of short oligonucleotides to perfectly matched and mismatched target 
sequence variants. Efficient access to polymorphism information is obtained through a basic 
structure comprising high-density arrays of oligonucleotide probes attached to a solid support 
35 (the chip) at selected positions. Each DNA chip can contain thousands to millions of individual 
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synthetic DNA probes arranged in a grid-like pattern and miniaturized to the size of a dime. 

The chip technology has already been applied with success in numerous cases. For 
example, the screening of mutations has been undertaken in the BRCA1 gene, in S. cerevisiae 
mutant strains, and in the protease gene of HIV-1 virus (Hacia et al., 1996; Shoemaker et al., 
5 1996 ; Kozal et al., 1996). Chips of various formats for use in detecting biallelic 

polymorphisms can be produced on a customized basis by Affymetrix (GeneChip™), Hyseq 
(HyChip and HyGnostics), and Protogene Laboratories. 

In general, these methods employ arrays of oligonucleotide probes that are 
complementary to target nucleic acid sequence segments from an individual which, target 

1 0 sequences include a polymorphic marker. EP785280, describes a tiling strategy for the 

detection of single nucleotide polymorphisms. Briefly, arrays may generally be "tiled" for a 
large number of specific polymorphisms. By filing" is generally meant the synthesis of a 
defined set of oligonucleotide probes which is made up of a sequence complementary to the 
target sequence of interest, as well as preselected variations of that sequence, e.g., substitution 

1 5 of one or more given positions with one or more members of the basis set of monomers, i.e. 

nucleotides. Tiling strategies are further described in PCT application No. WO 95/1 1995. In a 
particular aspect, arrays are tiled for a number of specific, identified biallelic marker sequences. 
In particular the array is tiled to include a number of detection blocks, each detection block 
being specific for a specific biallelic marker or a set of biallelic markers. For example, a 

20 detection block may be tiled to include a number of probes, which span the sequence segment 

that includes a specific polymorphism. To ensure probes that are complementary to each allele, 
the probes are synthesized in pairs differing at the biallelic marker. In addition to the probes 
differing at the polymorphic base, monosubstituted probes are also generally tiled within the 
detection block. These monosubstituted probes have bases at and up to a certain number of 

25 bases in either direction from the polymorphism, substituted with the remaining nucleotides 

(selected from A, T, G, C and U). Typically the probes in a tiled detection block will include 
substitutions of the sequence positions up to and including those that are 5 bases away from the 
biallelic marker. The monosubstituted probes provide internal controls for the tiled array, to 
distinguish actual hybridization from artefactual cross-hybridization. Upon completion of 

30 hybridization with the target sequence and washing of the array, the array is scanned to 

determine the position on the array to which the target sequence hybridizes. The hybridization 
data from the scanned array is then analyzed to identify which allele or alleles of the biallelic 
marker are present in the sample. Hybridization and scanning may be carried out as described 
in PCT application No. WO 92/10092 and WO 95/1 1995 and US patent No. 5,424,186. 

35 Thus, in some embodiments, the chips may comprise an array of nucleic acid sequences 
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of fragments of about 15 nucleotides in length. In further embodiments, the chip may comprise 
an array including at least one of the sequences selected from the group consisting of SEQ ID 
Nos. 1 to 1 7, 24, 25,28, 74 and 79 to 1 32 and the sequences complementary thereto, or a 
fragment thereof at least about 8 consecutive nucleotides, preferably 10, 15, 20, more preferably 
5 25, 30, 40, 47, or 50 consecutive nucleotides. In some embodiments, the chip may comprise an 

array of at least 2, 3, 4, 5, 6, 7, 8 or more of these polynucleotides of the invention. Solid 
supports and polynucleotides of the present invention attached to solid supports are further 
described in the section titled "Oligonucleotide probes and Primers". 
5) Integrated Systems 

1 0 Another technique, which may be used to analyze polymorphisms, includes 

multi component integrated systems, which miniaturize and compartmentalize processes such as 
PCR and capillary electrophoresis reactions in a single functional device. An example of such 
technique is disclosed in US patent 5,589,136, which describes the integration of PCR 
amplification and capillary electrophoresis in chips. 

15 Integrated systems can be envisaged mainly when microfluidic systems are used. These 

systems comprise a pattern of tnicrochannels designed onto a glass, silicon, quartz, or plastic 
wafer included on a microchip. The movements of the samples are controlled by electric, 
electroosmotic or hydrostatic forces applied across different areas of the microchip. For 
genotyping biallelic markers, the microfluidic system may integrate nucleic acid amplification, 

20 microsequencing, capillary electrophoresis and a detection method such as laser-induced 

fluorescence detection. 



Methods Of Genetic Analysis Using The Biallelic Markers Of The Present 
Invention 

25 Different methods are available for the genetic analysis of complex traits (see Lander 

and Schork, 1994). The search for disease-susceptibility genes, is conducted using two main 
methods: the linkage approach in which evidence is sought for cosegregation between a locus 
and a putative trait locus using family studies, and the association approach in which evidence is 
sought for a statistically significant association between an allele and a trait or a trait causing 

30 allele (Khoury J. et al, 1993). In general, the biallelic markers of the present invention find use 

in any method known in the art to demonstrate a statistically significant correlation between a 
genotype and a phenotype. The biallelic markers may be used in parametric and non-parametric 
linkage analysis methods. Preferably, the biallelic markers of the present invention are used to 
identify genes associated with detectable traits using association studies, an approach which 

35 does not require the use of affected families and which permits the identification of genes 
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associated with complex and sporadic traits. 

The genetic analysis using the biallelic markers of the present invention may be 
conducted on any scale. The whole set of biallelic markers of the present invention or any 
subset of biallelic markers of the present invention may be used. In some embodiments a subset 
5 of biallelic markers corresponding to one or several candidate genes of the present invention 

may be used. Alternatively, a subset of biallelic markers of the present invention localised on a 
specific chromosome segment may be used. Further, any set of genetic markers including a 
biallelic marker of the present invention may be used. As mentioned above, it should be noted 
that the biallelic markers of the present invention may be included in any complete or partial 
10 genetic map of the human genome. These different uses are specifically contemplated in the 

present invention and claims. 



Linkage analysis 

Linkage analysis is based upon establishing a correlation between the transmission of 
15 genetic markers and that of a specific trait throughout generations within a family. Thus, the 

aim of linkage analysis is to detect marker loci that show cosegregation with a trait of interest in 
pedigrees. 

Parametric methods 

When data are available from successive generations there is the opportunity to study 

20 the degree of linkage between pairs of loci. Estimates of the recombination fraction enable loci 

to be ordered and placed onto a genetic map. With loci that are genetic markers, a genetic map 
can be established, and then the strength of linkage between markers and traits can be calculated 
and used to indicate the relative positions of markers and genes affecting those traits (Weir, 
B.S., 1996). The classical method for linkage analysis is the logarithm of odds (lod) score 

25 method (see Morton N.E., 1955; Ott J, 1991). Calculation of lod scores requires specification of 

the mode of inheritance for the disease (parametric method). Generally, the length of the 
candidate region identified using linkage analysis is between 2 and 20Mb. Once a candidate 
region is identified as described above, analysis of recombinant individuals using additional 
markers allows further delineation of the candidate region. Linkage analysis studies have 

30 generally relied on the use of a maximum of 5,000 microsatellite markers, thus limiting the 

maximum theoretical attainable resolution of linkage analysis to about 600 kb on average. 

Linkage analysis has been successfully applied to map simple genetic traits that show 
clear Mendelian inheritance patterns and which have a high penetrance (i.e., the ratio between 
the number of trait positive carriers of allele a and the total number of a carriers in the 

35 population). However, parametric linkage analysis suffers from a variety of drawbacks. First, it 
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is limited by its reliance on the choice of a genetic model suitable for each studied trait 
Furthermore, as already mentioned, the resolution attainable using linkage analysis is limited, 
and complementary studies are required to refine the analysis of the typical 2Mb to 20Mb 
regions initially identified through linkage analysis. In addition, parametric linkage analysis 

5 approaches have proven difficult when applied to complex genetic traits, such as those due to 

the combined action of multiple genes an d/or environmental factors. It is very difficult to 
model these factors adequately in a lod score analysis. In such cases, too large an effort and 
cost are needed to recruit the adequate number of affected families required for applying linkage 
analysis to these situations, as recently discussed by Risch, N. and Merikangas, K. (1996). 

10 Non-parametric methods 

The advantage of the so-called non-parametric methods for linkage analysis is that they 
do not require specification of the mode of inheritance for the disease, they tend to be more 
useful for the analysis of complex traits. In non-parametric methods, one tries to prove that the 
inheritance pattern of a chromosomal region is not consistent with random Mendel ian 

15 segregation by showing that affected relatives inherit identical copies of the region more often 

than expected by chance. Affected relatives should show excess "allele sharing" even in the 
presence of incomplete penetrance and polygenic inheritance. In non-parametric linkage 
analysis the degree of agreement at a marker locus in two individuals can be measured either by 
the number of alleles identical by state (IBS) or by the number of alleles identical by descent 

20 (IBD). Affected sib pair analysis is a well-known special case and is the simplest form of these 

methods. 

The bialleiic markers of the present invention may be used in both parametric and non- 
parametric linkage analysis. Preferably bialleiic markers may be used in non-parametric 
methods which allow the mapping of genes involved in complex traits. The bialleiic markers of 
25 the present invention may be used in both IBD- and IBS- methods to map genes affecting a 

complex trait. In such studies, taking advantage of the high density of bialleiic markers, several 
adjacent bialleiic marker loci may be pooled to achieve the efficiency attained by muiti-allelic 
markers (Zhao et al. f 1998). 

However, both parametric and non-parametric linkage analysis methods analyse 
30 affected relatives, they tend to be of limited value in the genetic analysis of drug responses or in 

the analysis of side effects to treatments. This type of analysis is impractical in such cases due 
to the lack of availability of familial cases. In fact, the likelihood of having more than one 
individual in a family being exposed to the same drug at the same time is extremely low. 
Population Association Studies 
35 the present invention comprises methods for identifying one or several genes among a 
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set of candidate genes that are associated with a detectable trait using the biallelic markers of 
the present invention. In one embodiment the present invention comprises methods to detect an 
association between a biallelic marker allele or a biallelic marker haplotype and a trait. Further, 
the invention comprises methods to identify a trait causing allele in linkage disequilibrium with 
5 any biallelic marker allele of the present invention. 

As described above, alternative approaches can be employed to perform association 
studies: genome-wide association studies, candidate region association studies and candidate 
gene association studies. The candidate region analysis clearly provides a short-cut approach to 
the identification of genes and gene polymorphisms related to a particular trait when some 
10 information concerning the biology of the trait is available. Further, the biallelic markers of the 

present invention may be incorporated in any map of genetic markers of the human genome in 
order to perform genome- wide association studies. Methods to generate a high-density map of 
biallelic markers has been described in US Provisional Patent application serial number 
60/082,614. The biallelic markers of the present invention may further be incorporated in any 
1 5 map of a specific candidate region of the genome (a specific chromosome or a specific 

chromosomal segment for example). 

As mentioned above, association studies may be conducted within the general 
population and are not limited to studies performed on related individuals in affected families. 
Association studies are extremely valuable as they permit the analysis of sporadic or multifactor 
20 traits, Moreover, association studies represent a powerful method for fine-scale mapping 

enabling much finer mapping of trait causing alleles than linkage studies. Studies based on 
pedigrees often only narrow the location of the trait causing allele. Association studies using 
the biallelic markers of the present invention can therefore be used to refine the location of a 
trait causing allele in a candidate region identified by Linkage Analysis methods. Biallelic 
25 markers of the present invention can be used to identify the involved gene; such uses are 

specifically contemplated in the present invention and claims. 

1) Determining the frequency of a biallelic marker allele or of a biallelic marker 
haplotype in a population 

Another embodiment of the present invention encompasses methods of estimating the 
30 frequency of a haplotype for a set of biallelic markers in a population, comprising the steps of: 

a) genotyping each individual in said population for at least one 13q31-q33-related biallelic 
marker, b) genotyping each individual in said population for a second biallelic marker by 
determining the identity of the nucleotides at said second biallelic marker for both copies of said 
second biallelic marker present in the genome; and c) applying a haplotype determination 
35 method to the identities of the nucleotides determined in steps a) and b) to obtain an estimate of 
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said frequency. In addition, the methods of estimating the frequency of a haplotype of the 
invention encompass methods with any further limitation described in this disclosure, or those 
following, specified alone or in any combination: optionally said haplotype determination 
method is selected from the group consisting of asymmetric PCR amplification, double PCR 
5 amplification of specific alleles, the Clark method, or an expectation maximization algorithm; 

optionally, said second biallelic marker is a 13q31-q33-related biallelic marker in a sequence 
selected from the group consisting of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 79 to 132, and the 
complements thereof; optionally, said 13q31-q33-related biallelic marker may be selected 
individually or in any combination from the biallelic markers described in Tables 6b and 6c; 
10 optionally, the identity of the nucleotides at the biallelic markers in everyone of the sequences 

of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 79 to 132 is determined in stepsa) and b). 

Association studies explore the relationships among frequencies for sets of alleles 
between loci. 

Determining the frequency of an allele in a population 

15 Allelic frequencies of the biallelic markers in a population can be determined using one 

of the methods described above under the heading "Methods for genotyping an individual for 
biallelic markers", or any genotyping procedure suitable for this intended purpose. Genotyping 
pooled samples or individual samples can determine the frequency of a biallelic marker allele in 
a population. One way to reduce the number of genotypings required is to use pooled samples. 

20 A major obstacle in using pooled samples is in terms of accuracy and reproducibility for 

determining accurate DNA concentrations in setting up the pools. Genotyping individual 
samples provides higher sensitivity, reproducibility and accuracy and; is the preferred method 
used in the present invention. Preferably, each individual is genotyped separately and simple 
gene counting is applied to determine the frequency of an allele of a biallelic marker or of a 

25 genotype in a given population. 

Determining the frequency of a haplotype in a population 
The gametic phase of haplotypes is unknown when diploid individuals are 
heterozygous at more than one locus. Using genealogical information in families gametic phase 
can sometimes be inferred (Perlin et al., 1994). When no genealogical information is available 

30 different strategies may be used. One possibility is that the multiple-site heterozygous diploids 

can be eliminated from the analysis, keeping only the homozygotes and the single-site 
heterozygote individuals, but this approach might lead to a possible bias in the sample 
composition and the underestimation of low-frequency haplotypes. Another possibility is that 
single chromosomes can be studied independently, for example, by asymmetric PCR 

35 amplification (see Newton et al., 1 989; Wu et aL, 1 989) or by isolation of single chromosome 
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by limit dilution followed by PCR amplification (see Ruano et al., 1990). Further, a sample may 
be haplotyped for sufficiently close biallelic markers by double PCR amplification of specific 
alleles (Sarkar, G. and Sommer S.S., 1991). These approaches are not entirely satisfying either 
because of their technical complexity, the additional cost they entail, their lack of generalisation 
5 at a large scale, or the possible biases they introduce. To overcome these difficulties, an 

algorithm to infer the phase of PCR-amplified DNA genotypes introduced by Clark A.G. (1990) 
may be used. Briefly, the principle is to start filling a preliminary list of haplotypes present in 
the sample by examining unambiguous individuals, that is, the complete homozygotes and the 
single-site heterozygotes. Then other individuals in the same sample are screened for the 

10 possible occurrence of previously recognised haplotypes. For each positive identification, the 

complementary haplotype is added to the list of recognised haplotypes, until the phase 
information for all individuals is either resolved or identified as unresolved. This method 
assigns a single haplotype to each multiheterozygous individual, whereas several haplotypes are 
possible when there are more than one heterozygous site. Alternatively, one can use methods 

1 5 estimating haplotype frequencies in a population without assigning haplotypes to each 

individual. Preferably, a method based on an expectation-maximization (EM) algorithm 
(Dempster et al., J, R. 1977) leading to maximum-likelihood estimates of haplotype frequencies 
under the assumption of Hardy-Weinberg proportions (random mating) is used (see Excoffier L. 
and Slatkin M., 1995). The EM algorithm is a generalised iterative maximum- likelihood 

20 approach to estimation that is useful when data are ambiguous and/or incomplete. The EM 

algorithm is used to resolve heterozygotes into haplotypes. Haplotype estimations are further 
described below under the heading "Statistical methods». Any other method known in the art to 
determine or to estimate the frequency of a haplotype in a population may also be used. 
2) Linkage Disequilibrium analysis 

25 Linkage disequilibrium is the non-random association of alleles at two or more loci and 

represents a powerful tool for mapping genes involved in disease traits (see Ajioka R.S. et al., 
1997). Biallelic markers, because they are densely spaced in the human genome and can be 
genotyped in more numerous numbers than other types of genetic markers (such as RFLP or 
VNTR markers), are particularly useful in genetic analysis based on linkage disequilibrium. 

30 The biallelic markers of the present invention may be used in any linkage disequilibrium 

analysis method known in the art. 

Briefly, when a disease mutation is first introduced into a population (by a new 
mutation or the immigration of a mutation carrier), it necessarily resides on a single 
chromosome and thus on a single "background" or "ancestral" haplotype of linked markers. 

35 Consequently, there is complete disequilibrium between these markers and the disease 
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mutation: one finds the disease mutation only in the presence of a specific set of marker alleles. 
Through subsequent generations recombinations occur between the disease mutation and these 
marker polymorphisms, and the disequilibrium gradually dissipates. The pace of this 
dissipation is a function of the recombination frequency, so the markers closest to the disease 
5 gene will manifest higher levels of disequilibrium than those that are further away. When not 

broken up by recombination, "ancestral" haplotypes and linkage disequilibrium between marker 
alleles at different loci can be tracked not only through pedigrees but also through populations. 
Linkage disequilibrium is usually seen as an association between one specific allele at one locus 
and another specific allele at a second locus. 

10 The pattern or curve of disequilibrium between disease and marker loci is expected to 

exhibit a maximum that occurs at the disease locus. Consequently, the amount of linkage 
disequilibrium between a disease allele and closely linked genetic markers may yield valuable 
information regarding the location of the disease gene. For fine-scale mapping of a disease 
locus, it is useful to have some knowledge of the patterns of linkage disequilibrium that exist 

15 between markers in the studied region. As mentioned above the mapping resolution achieved 

through the analysis of linkage disequilibrium is much higher than that of linkage studies. The 
high density of biallelic markers combined with linkage disequilibrium analysis provides 
powerful tools for fine-scale mapping. Different methods to calculate linkage disequilibrium 
are described below under the heading "Statistical Methods". 

20 3) Population-based case-control studies of trait-marker associations 

As mentioned above, the occurrence of pairs of specific alleles at different loci on the 
same chromosome is not random and the deviation from random is called linkage 
disequilibrium. Association studies focus on population frequencies and rely on the 
phenomenon of linkage disequilibrium. If a specific allele in a given gene is directly involved 

25 in causing a particular trait, its frequency will be statistically increased in an affected (trait 

positive) population, when compared to the frequency in a trait negative population or in a 
random control population. As a consequence of the existence of linkage disequilibrium, the 
frequency of all other alleles present in the haplotype carrying the trait-causing allele will also 
be increased in trait positive individuals compared to trait negative individuals or random 

30 controls. Therefore, association between the trait and any allele (specifically a biallelic marker 

allele) in linkage disequilibrium with the trait-causing allele will suffice to suggest the presence 
of a trait-related gene in that particular region. Case-control populations can be genotyped for 
biallelic markers to identify associations that narrowly locate a trait causing allele. As any 
marker in linkage disequilibrium with one given marker associated with a trait will be 

35 associated with the trait. Linkage disequilibrium allows the relative frequencies in case-control 
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populations of a limited number of genetic polymorphisms (specifically biallelic markers) to be 
analysed as an alternative to screening all possible functional polymorphisms in order to find 
trait-causing alleles. Association studies compare the frequency of marker alleles in unrelated 
case-control populations, and represent powerful tools for the dissection of complex traits. 
5 Case-control populations (inclusion criteria) 

Population-based association studies do not concern familial inheritance but compare 
the prevalence of a particular genetic marker, or a set of markers, in case-control populations. 
They are case-control studies based on comparison of unrelated case (affected or trait positive) 
individuals and unrelated control (unaffected or trait negative or random) individuals. 

10 Preferably the control group is composed of unaffected or trait negative individuals. Further, 

the control group is ethnically matched to the case population. Moreover, the control group is 
preferably matched to the case-population for the main known confusion factor for the trait 
under study (for example age-matched for an age-dependent trait). Ideally, individuals in the 
two samples are paired in such a way that they are expected to differ only in their disease status. 

15 In the following "trait positive populations "case population" and "affected population*' are 

used interchangeably. 

An important step in the dissection of complex traits using association studies is the 
choice of case-control populations (see Lander and Schork, 1994). A major step in the choice 
of case-control populations is the clinical definition of a given trait or phenotype. Any genetic 

20 trait may be analysed by the association method proposed here by carefully selecting the 

individuals to be included in the trait positive and trait negative phenotypic groups. Four 
criteria are often useful: clinical phenotype, age at onset, family history and severity. The 
selection procedure for continuous or quantitative traits (such as blood pressure for example) 
involves selecting individuals at opposite ends of the phenotype distribution of the trait under 

25 study, so as to include in these trait positive and trait negative populations individuals with non- 

overlapping phenotypes. Preferably, case-control populations comprise phenotypically 
homogeneous populations. Trait positive and trait negative populations comprise 
phenotypically uniform populations of individuals representing each between 1 and 98%, 
preferably between 1 and 80%, more preferably between 1 and 50%, and more preferably 

30 between 1 and 30%, most preferably between 1 and 20% of the total population under study, 

and selected among individuals exhibiting non-overlapping phenotypes. The clearer the 
difference between the two trait phenotypes, the greater the probability of detecting an 
association with biallelic markers. The selection of those drastically different but relatively 
uniform phenotypes enables efficient comparisons in association studies and the possible 

35 detection of marked differences at the genetic level, provided that the sample sizes of the 
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populations under study are significant enough. 

In preferred embodiments, a first group of between 50 and 300 trait positive 
indi viduals^ preferably about 1 00 individuals, are recruited according to their phenotypes. A 
similar number of trait negative individuals are included in such studies. 
5 In the present invention, typical examples of inclusion criteria include affection by 

schizophrenia. 

Association analysis 

The general strategy to perform association studies using biallelic markers derived from 
a region carrying a candidate gene is to scan two groups of individuals (case-control 

10 populations) in order to measure and statistically compare the allele frequencies of the biallelic 

markers of the present invention in both groups. 

If a statistically significant association with a trait is identified for at least one or more 
of the analysed biallelic markers, one can assume that: either the associated allele is directly 
responsible for causing the trait (the associated allele is the trait causing allele), or more likely 

15 the associated allele is in linkage disequilibrium with the trait causing allele. The specific 

characteristics of the associated allele with respect to the gene function usually gives further 
insight into the relationship between the associated allele and the trait (causal or in linkage 
disequilibrium). If the evidence indicates that the associated allele within the gene is most 
probably not the trait causing allele but is in linkage disequilibrium with the real trait causing 

20 allele, then the trait causing allele can be found by sequencing the vicinity of the associated 

marker. 

Another embodiment of the present invention encompasses methods of detecting an 
association between a haplotype and a phenotype, comprising the steps of: a) estimating the 
frequency of at least one haplotype in a trait positive population according to a method of 

25 estimating the frequency of a haplotype of the invention; b) estimating the frequency of said 

haplotype in a control population according to the method of estimating the frequency of a 
haplotype of the invention; and c) determining whether a statistically significant association 
exists between said haplotype and said phenotype. In addition, the methods of detecting an 
association between a haplotype and a phenotype of the invention encompass methods with any 

30 further limitation described in this disclosure, or those following, specified alone or in any 

combination: Optionally, said 13q3 l-q33-related biallelic marker may be in a sequence 
selected individually or in any combination from the group consisting of SEQ ID Nos 1 to 17, 
24, 25, 28, 74 and 79 to 132, and the complements thereof; optionally, said 13q31-q33-related 
biallelic marker may be selected individually or in any combination from the biallelic markers 

35 described in Tables 6b and 6c; optionally, said control population may be a trait negative 
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population, or a random population; optionally, said phenotype is a disease involving 
schizophrenia, a response to an agent acting on schizophrenia, or a side effects to an agent 
acting on schizophrenia. 

Haplotype analysis 

5 As described above, when a chromosome carrying a disease allele first appears in a 

population as a result of either mutation or migration, the mutant allele necessarily resides on a 
chromosome having a set of linked markers: the ancestral haplotype. This haplotype can be 
tracked through populations and its statistical association with a given trait can be analysed. 
Complementing single point (allelic) association studies with multi-point association studies 

10 also called haplotype studies increases the statistical power of association studies. Thus, a 

haplotype association study allows one to define the frequency and the type of the ancestral 
carrier haplotype. A haplotype analysis is important in that it increases the. statistical power of 
an analysts involving individual markers. 

In a first stage of a haplotype frequency analysis, the frequency of the possible 

15 haplotypes based on various combinations of the identified biallelic markers of the invention is 

determined. The haplotype frequency is then compared for distinct populations of trait positive ' 
and control individuals. The number of trait positive individuals, which should be, subjected to 
this analysis to obtain statistically significant results usually ranges between 30 and 300, with a 
preferred number of individuals ranging between 50 and 150. The same considerations apply to 

20 the number of unaffected individuals (or random control) used in the study. The results of this 

first analysis provide haplotype frequencies in case-control populations, for each evaluated 
haplotype frequency a p-value and an odd ratio are calculated. If a statistically significant 
association is found the relative risk for an individual carrying the given haplotype of being 
affected with the trait under study can be approximated* 

25 Interaction Analysis 

The biallelic markers of the present invention may also be used to identify patterns of 
biallelic markers associated with detectable traits resulting from polygenic interactions. The 
analysis of genetic interaction between alleles at unlinked loci requires individual genotyping 
using the techniques described herein. The analysis of allelic interaction among a selected set 

30 of biallelic markers with appropriate level of statistical significance can be considered as a 

haplotype analysis. Interaction analysis comprises stratifying the case-control populations with 
respect to a given haplotype for the first loci and performing a haplotype analysis with the 
second loci with each subpopulation. 

Statistical methods used in association studies are further described herein. 

35 4) Testing for linkage in the presence of association 
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The biallelic markers of the present invention may further be used in TDT 
(transmission/disequilibrium test). TDT tests for both linkage and association and is not 
affected by population stratification. TDT requires data for affected individuals and their 
parents or data from unaffected sibs instead of from parents (see Spielmann S. et al., 1993; 
5 Schaid DJ. et al., 1996, Spielmann S. and Ewens W.J, 1998). Such combined tests generally 

reduce the false - positive errors produced by separate analyses. 

Statistical methods 

In general, any method known in the art to test whether a trait and a genotype show a 
statistically significant correlation may be used. 
10 1) Methods in linkage analysis 

Statistical methods and computer programs useful for linkage analysis are well-known 
to those skilled in the art (see Terwilliger J.D. and Ott J., 1994; Ott J., 1991). 

2) Methods to estimate haplotype frequencies in a population 

As described above, when genotypes are scored, it is often not possible to distinguish 
15 heterozygotes so that haplotype frequencies cannot be easily inferred. When the gametic phase 

is not known, haplotype frequencies can be estimated from the multilocus genotypic data. Any 
method known to person skilled in the art can be used to estimate haplotype frequencies (see 
Lange K. s 1997; Weir, B.S., 1996) Preferably, maximum-likelihood haplotype frequencies are 
computed using an Expectation- Maximization (EM) algorithm (see Dempster et al., 1977; 
20 ExcofFier L. and Slatkin M., 1995). This procedure is an iterative process aiming at obtaining 

maximum-likelihood estimates of haplotype frequencies from multi-locus genotype data when 
the gametic phase is unknown. Haplotype estimations are usually performed by applying the 
EM algorithm using for example the EM-HAPLO program (Hawley M.E. et al., 1994) or the 
Arlequin program (Schneider et al., 1997). The EM algorithm is a generalised iterative 
25 maximum likelihood approach to estimation and is briefly described below. 

In the following part of this text, phenotypes will refer to multi-locus genotypes with 
unknown phase. Genotypes will refer to known-phase multi-locus genotypes. Suppose a 
sample of N unrelated individuals typed for K markers. The data observed are the unknown- 
phase K-locus phenotypes that can categorised in F different phenotypes. Suppose that we have 
30 H underlying possible haplotypes (in case of K biallelic markers, H=2 K ). 

For phenotype j, suppose that cj genotypes are possible. We thus have the following 
equation 

c j c j 
Pj = I pr (genotype i) = Spr(A^ ,h t ) Equation 1 
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where Pj is the probability of the phenotype j, h k and h\ are the two haplotypes constituent the 
genotype Under the Hardy-Weinberg equilibrium, pr(hM becomes : 

pr{h k ,hi) = pr(h k ) 2 if h k = A/,/?r(*A^/) = 2pr(A Jt ). ; pr(A/)if h k * h l ■ Equation 2 
The successive steps of the E-M algorithm can be described as follows: 

5 Starting with initial values of the of haplotypes frequencies, noted p[ 0) , > 

these initial values serve to estimate the genotype frequencies (Expectation step) and then 

estimate another set of haplotype frequencies (Maximisation step), noted p^ 9 p^ , p$ , 

these two steps are iterated until changes in the sets of haplotypes frequency are very small. 
A stop criterion can be that the maximum difference between haplotype frequencies 

1 0 between two iterations is less than 10" 7 . These values can be adjusted according to the desired 

precision of estimations. In details, at a given iteration s, the Expectation step comprises 
calculating the genotypes frequencies by the following equation: 

pr (genotype = pr (phenotype j).pr (genotype i\phenotype j)^ 

_ n ) M**>*/) W Equation 3 

where genotype / occurs in phenotype y, and where hk and hi constitute genotype /. Each 
15 probability is derived according to eq.l, and eq.2 described above. 

Then the Maximisation step simply estimates another set of haplotype frequencies 
given the genotypes frequencies. This approach is also known as gene-counting method (Smith, 
1957). 

P? +1) 4^^ a W ,) Equation 4 

2 y=l/=l 

20 Where 8ft is an indicator variable which count the number of time haplotype t in genotype /. It 

takes the values of 0, 1 or 2. 

To ensure that the estimation finally obtained is the maximum-likelihood estimation 
several values of departures are required. The estimations obtained are compared and if they are 
different the estimations leading to the best likelihood are kept. 
25 3) Methods to calculate linkage disequilibrium between markers 

A number of methods can be used to calculate linkage disequilibrium between any two 
genetic positions, in practice linkage disequilibrium is measured by applying a statistical 
association test to haplotype data taken from a population. Linkage disequilibrium between any 
pair of biallelic markers comprising at least one of the bialleiic markers of the present invention 
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(M is Mj) having alleles (aj/bi) at marker Mj and alleles (a/bj) at marker Mj can be calculated for 
every allele combination (aj,aj . a i? bj ; b^aj and bj,bj), according to the Piazza formula : 
A aiaj = V64 W (94 + 93) (94 +92), where : 

64= - - = frequency of genotypes not having allele aj at M,- and not having allele aj at Mj 
5 93= - + = frequency of genotypes not having allele aj at Mj and having allele aj at Mj 

92= + - = frequency of genotypes having allele aj at M,* and not having allele aj at Mj 
Linkage disequilibrium (LD) between pairs of biallelic markers (M i5 Mj) can also be calculated 
for every allele combination (ai,aj ; ai,bj ; bi,aj and b is bj), according to the maximum-likelihood 
estimate (MLE) for delta (the composite genotypic disequilibrium coefficient), as described by 
10 Weir (Weir B.S., 1996). The MLE for the composite linkage disequilibrium is: 

(2m + n 2 + n 3 + n 4 /2)/N - 2(pr(a0.pr(aj)) 
where n.i = £ phenotype (a/aj, aj/aj), n 2 - £ phenotype (a/a^ a/bj), n3= £ phenotype (a t /b iy aj/aj), 
n4~ £ phenotype (a/b^, a/bj) and N is the number of individuals in the sample. This formula 
allows linkage disequilibrium between alleles to be estimated when only genotype, and not 
15 hapiotype, data are available. 

Another means of calculating the linkage disequilibrium between markers is as follows. 
For a couple of biallelic markers, N£ t (a/b f ) and Mj(a/bj) 9 fitting the Hardy- Weinberg 
equilibrium, one can estimate the four possible hapiotype frequencies in a given population 
according to the approach described above. 
20 The estimation of gametic disequilibrium between ai and aj is simply: 

I>aiaj = pr(haplotype(fii,ajy) - pr(ai)'.pr(a j). 

Where pr(a) is the probability of allele a f and pr(aj) is the probability of allele aj and where 
pr(haplotype (a h a)) is estimated as in Equation 3 above. 

For a couple of biallelic marker only one measure of disequilibrium is necessary to 
25 describe the association between Mi and Mj. 

Then a normalised value of the above is calculated as follows: 

/ max (-pr(ai).pr(aj) , -pr(bj),pr(bj)) with D aiiO <0 
D , a u j «D ataj /max(pr(bi).pr(a j ) 5 pr(aj).pr(bj)) withD aiaj >0 
The skilled person will readily appreciate that other LD calculation methods can be used 
30 without undue experimentation. 

Linkage disequilibrium among a set of biallelic markers having an adequate 
heterozygosity rate can be determined by genotyping between 50 and 1000 unrelated 
individuals, preferably between 75 and 200, more preferably around 100. 
4) Testing for association 
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Methods for determining the statistical significance of a correlation between a 
phenotype and a genotype, in this case an allele at a biallelic marker or a haplotype made up of 
such alleles, may be determined by any statistical test known in the art and with any accepted 
threshold of statistical significance being required. The application of particular methods and 
5 thresholds of significance are well with in the skill of the ordinary practitioner of the art. 

Testing for association is performed by determining the frequency of a biallelic marker 
allele in case and control populations and comparing these frequencies with a statistical test to 
determine if their is a statistically significant difference in frequency which would indicate a 
correlation between the trait and the biallelic marker allele under study. Similarly, a haplotype 

10 analysis is performed by estimating the frequencies of all possible haplotypes for a given set of 

biallelic markers in case and control populations, and comparing these frequencies with a 
statistical test to determine if their is a statistically significant correlation between the haplotype 
and the phenotype (trait) under study. Any statistical tool useful to test for a statistically 
significant association between a genotype and a phenotype may be used. Preferably the 

15 statistical test employed is a chi-square test with one degree of freedom. A P-value is calculated 

(the P-value is the probability that a statistic as large or larger than the observed one would 
occur by chance). 
Statistical significance 

In preferred embodiments, significance for diagnosis purposes, either as a positive basis 

20 for further diagnostic tests or as a preliminary starting point for early preventive therapy, the p 

value related to a biallelic marker association is preferably about 1 x 10" 2 or less, more 
preferably about 1 x 10" 4 or less, for a single biallelic marker analysis and about 1 x 1 0" 3 or less, 
still more preferably 1 x 10" 6 or less and most preferably of about 1 x 10" 8 or less, for a 
haplotype analysis involving several markers. These values are believed to be applicable to any 

25 association studies involving single or multiple marker combinations. 

The skilled person can use the range of values set forth above as a starting point in 
order to carry out association studies with biallelic markers of the present invention. In doing 
so, significant associations between the biallelic markers of the present invention and diseases 
involving schizophrenia can be revealed and used for diagnosis and drug screening purposes. 

30 Phenotypic permutation 

In order to confirm the statistical significance of the first stage haplotype analysis 
described above, it might be suitable to perform further analyses in which genotyping data from 
case-control individuals are pooled and randomised with respect to the trait phenotype. Each 
individual genotyping data is randomly allocated to two groups, which contain the same number 

35 of individuals as the case-control populations used to compile the data obtained in the first 
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stage. A second stage haplotype analysis is preferably run on these artificial groups, preferably 
for the markers included in the haplotype of the first stage analysis showing the highest relative 
risk coefficient. This experiment is reiterated preferably at least between 100 and 10000 times. 
The repeated iterations allow the determination of the percentage of obtained haplotypes with a 
5 significant p-value level. 

Assessment of statistical association 

To address the problem of false positives similar analysis may be performed with the 
same case-control populations in random genomic regions. Results in random regions and the 
candidate region are compared as described in US Provisional Patent Application entitled 
10 "Methods, software and apparati for identifying genomic regions harbouring a gene associated 

with a detectable trait". 
5) Evaluation of risk factors 

The association between a risk factor (in genetic epidemiology the risk factor is the 
presence or the absence of a certain allele or haplotype at marker loci) and a disease is measured 
15 by the odds ratio (OR) and by the relative risk (RR). If P(R + ) is the probability of developing 

the disease for individuals with R and P(R") is the probability for individuals without the risk 
factor, then the relative risk is simply the ratio of the two probabilities, that is: 

RR= P(R + )/P(R") 

In case-control studies, direct measures of the relative risk cannot be obtained because of the 
20 sampling design. However, the odds ratio allows a good approximation of the relative risk for 

low-incidence diseases and can be calculated: 



F* ' 




F~ 


1-F + 


/ 





F + is the frequency of the exposure to the risk factor in cases and F" is the frequency of the 
exposure to the risk factor in controls. F + and F" are calculated using the allelic or haplotype 
25 frequencies of the study and further depend on the underlying genetic model (dominant, 

recessive, additive...). 

One can further estimate the attributable risk (AR) which describes the proportion of individuals 
in a population exhibiting a trait due to a given risk factor. This measure is important in 
quantitating the role of a specific factor in disease etiology and in terms of the public health 
30 impact of a risk factor. The public health relevance of this measure lies in estimating the 

proportion of cases of disease in the population that could be prevented if the exposure of 
interest were absent. AR is determined as follows: 

AR = P E (RR-1)/ (P E (RR-1)+1) 
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AR is the risk attributable to a biallelic marker allele or a biallelic marker haplotype. P E is the 
frequency of exposure to an allele or a haplotype within the population at large; and RR is the 
relative risk which, is approximated with the odds ratio when the trait under study has a 
relatively low incidence in the general population. 
5 AR is the risk attributable to a biallelic marker allele or a biallelic marker haplotype, 

P£ is the frequency of exposure to an allele or a haplotype within the population at large; and 
RR is the relative risk which, is approximated with the odds ratio when the trait under study has 
a relatively low incidence in the general population. 



10 Association of Biallelic Markers of the Invention with Schizophrenia 

Tn the context of the present invention, an association between chromosome 1 3 q3 1 -q3 3 - 
related biallelic markers and schizophrenia and bipolar disorder were established. Several 
association studies using different populations and screening samples thereof, and with different 
sets of biallelic markers distributed on the chromosome 13q3 1 -q33 region were carried out. 
1 5 Further details concerning these association studies and the results are provided herein in 

Examples 5a to 5e. 

This information is extremely valuable. The knowledge of a potential genetic 
predisposition to schizophrenia, even if this predisposition is not absolute, might contribute in a 
very significant manner to treatment efficacy of schizophrenia and to the development of new 
20 . therapeutic and diagnostic tools. 

Identification Of Biallelic Markers In Linkage Disequilibrium With The Biallelic 
Markers of the Invention 

Once a first biallelic marker has been identified in a genomic region of interest, the 
25 practitioner of ordinary skill in the art, using the teachings of the present invention, can easily 

identify additional biallelic markers in linkage disequilibrium with this first marker. As 
mentioned before, any marker in linkage disequilibrium with a first marker associated with a 
trait will be associated with the trait. Therefore, once an association has been demonstrated 
between a given biallelic marker and a trait, the discovery of additional biallelic markers 
30 associated with this trait is of great interest in order to increase the density of biallelic markers 

in this particular region. The causal gene or mutation will be found in the vicinity of the marker 
or set of markers showing the highest correlation with the trait. 

Identification of additional markers in linkage disequilibrium with a given marker 
involves: (a) amplifying a genomic fragment comprising a first biallelic marker from a plurality 
35 of individuals; (b) identifying of second biallelic markers in the genomic region harboring said 
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first biallelic marker; (c) conducting a linkage disequilibrium analysis between said first 
biallelic marker and second biallelic markers; and (d) selecting said second biallelic markers as 
being in linkage disequilibrium with said first marker. Subcombinations comprising steps (b) 
and (c) are also contemplated. 
5 Methods to identify biallelic markers and to conduct linkage disequilibrium analysis are 

described herein and can be carried out by the skilled person without undue experimentation. 
The present invention then also concerns biallelic markers and other polymorphisms which are 
in linkage disequilibrium with the specific biallelic markers of the invention and which are 
expected to present similar characteristics in terms of their respective association with a given 
10 trait. In a preferred embodiment, the invnetion concerns biallelic markers which are in linkage 

disequilibrium with the specific biallelic markers. 

Identification Of Functional Mutations 

Once a positive association is confirmed with a biallelic marker of the present 

15 invention, the associated candidate gene sequence can be scanned for mutations by comparing 

the sequences of a selected number of trait positive and trait negative individuals. In a preferred 
embodiment, functional regions such as exons and splice sites, promoters and other regulatory 
regions of the gene are scanned for mutations. Preferably, trait positive individuals carry the 
haplotype shown to be associated with the trait and trait negative individuals do not carry the 

20 haplotype or allele associated with the trait. The mutation detection procedure is essentially 

similar to that used for biallelic site identification. 

The method used to detect such mutations generally comprises the following steps: (a) 
amplification of a region of the candidate DNA sequence comprising a biallelic marker or a 
group of biallelic markers associated with the trait from DNA samples of trait positive patients 

25 and trait negative controls; (b) sequencing of the amplified region; (c) comparison of DNA 

sequences from trait-positive patients and trait-negative controls; and (d) determination of 
mutations specific to trait-positive patients. Subcombinations which comprise steps (b) and (c) 
are specifically contemplated. 

It is preferred that candidate polymorphisms be then verified by screening a larger 

30 population of cases and controls by means of any genotyping procedure such as those described 

herein, preferably using a microsequencing technique in an individual test format. 
Polymorphisms are considered as candidate mutations when present in cases and controls at 
frequencies compatible with the expected association results. 

Candidate polymorphisms and mutations of the g35030 nucleic acid sequences 

35 suspected of being involved in a predisposition to schizophrenia can be confirmed by screening 
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a larger population of affected and unaffected individuals using any of the genotyping 
procedures described herein. Preferably the microsequencing technique is used. Such 
polymorphisms are considered as candidate "trait-causing" mutations when they exhibit a 
statistically significant correlation with the detectable phenotype. 

5 

Biallelic Markers Of The Invention In Methods Of Genetic Diagnostics 
The biallelic markers and other polymorphisms of the present invention can also be 
used to develop diagnostics tests capable of identifying individuals who express a detectable 
trait as the result of a specific genotype or individuals whose genotype places them at risk of 

10 developing a detectable trait at a subsequent time. The trait analyzed using the present 

diagnostics may be any detectable trait, including predisposition to schizophrenia, age of onset 
of detectable symptoms, a beneficial response to or side effects related to treatment against 
schizophrenia. Such a diganosis can be useful in the monitoring, prognosis and/or prophylactic 
or curative therapy for schizophrenia. 

15 The diagnostic techniques of the present invention may employ a variety of 

methodologies to determine whether a test subject has a genotype associated with an increased 
risk of developing a detectable trait or whether the individual suffers from a detectable trait as a 
result of a particular mutation, including methods which enable the analysis of individual 
chromosomes for haplotyping, such as family studies, single sperm UNA analysis or somatic 

20 hybrids. 

The diagnostic techniques concern the detection of specific alleles present within the 
human chromosome 13q31-q33 region and optionally within a g35030 nucleic acid sequence. 
More particularly, the invention concerns the detection of a nucleic acid comprising at least one 
of the nucleotide sequences of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or a fragment 
25 thereof or a complementary sequence thereto including the polymorphic base. 

These methods involve obtaining a nucleic acid sample from the individual and, 
determining, whether the nucleic acid sample contains at least one allele or at least one biallelic 
marker haplotype, indicative of a risk of developing the trait or indicative that the individual 
expresses the trait as a result of possessing a particular the human chromosome 1 3q3 1 -q33- or 
30 g35O30-related polymorphism or mutation (trait-causing allele). 

Preferably, in such diagnostic methods, a nucleic acid sample is obtained from the 
individual and this sample is genotyped using methods described above in "Methods Of 
Genotyping DNA Samples For Biallelic markers." The diagnostics may be based on a single 
biallelic marker or a on group of biallelic markers. 
35 In each of these methods, a nucleic acid sample is obtained from the test subject and the 



WO 01/40493 PCT/IBOO/01507 

100 

biallelic marker pattern of one or more of the biallelic markers of the invention is determined. 

In one embodiment, a PCR amplification is conducted on the nucleic acid sample to 
amplify regions in which polymorphisms associated with a detectable phenotype have been 
identified. The amplification products are sequenced to determine whether the individual 
5 possesses one or more human chromosome 1 3q3 l~q33 region- or g3503O-related 

polymorphisms associated with a detectable phenotype. The primers used to generate 
amplification products may comprise the primers listed in Table 4. Alternatively, the nucleic 
acid sample is subjected to microsequencing reactions as described above to determine whether 
the individual possesses one or more human chromosome 13q31-q33 region-related 

10 polymorphisms associated with a detectable phenotype resulting from a mutation or a 

polymorphism in the human chromosome 13q31-q33 region or g35030-related biallelic marker. 
The primers used in the microsequencing reactions may include the primers listed in 6. In 
another embodiment, the nucleic acid sample is contacted with one or more allele specific 
oligonucleotide probes which, specifically hybridize to one or more human chromosome 13q31- 

15 q33 region or g35030-related alleles associated with a detectable phenotype. The probes used 

in the hybridization assay may include the probes listed in Table 5. In another embodiment, the 
nucleic acid sample is contacted with a second oligonucleotide capable of producing an 
amplification product when used with the allele specific oligonucleotide in an amplification 
reaction. The presence of an amplification product in the amplification reaction indicates that 

20 the individual possesses one or more human chromosome 13q31-q33 region or g35030-related 

alleles associated with a detectable phenotype. 

In a preferred embodiment the identity of the nucleotide present at, at least one, biallelic 
marker selected from the group consisting of Al 3 to Al 8, A20 to A46, A49 to A52, A55, A57, 
A59 to A63, A72 to A73, A76 and A123 and the complements thereof, is determined and the 

25 detectable trait is schizophrenia. Diagnostic kits comprise any of the polynucleotides of the 

present invention. 

These diagnostic methods are extremely valuable as they can, in certain circumstances, 
be used to initiate preventive treatments or to allow an individual carrying a significant 
haplotype to foresee warning signs such as minor symptoms. 

30 Diagnostics, which analyze and predict response to a drug or side effects to a drug, may 

be used to determine whether an individual should be treated with a particular drug. For 
example, if the diagnostic indicates a likelihood that an individual will respond positively to 
treatment with a particular drug, the drug may be administered to the individual. Conversely, if 
the diagnostic indicates that an individual is likely to respond negatively to treatment with a 

35 particular drug, an alternative course of treatment may be prescribed. A negative response may 
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be defined as either the absence of an efficacious response or the presence of toxic side effects. 

Clinical drug trials represent another application for the markers of the present 
invention. One or more markers indicative of response to an agent acting against schizophrenia 
or to side effects to an agent acting against schizophrenia may be identified using the methods 
5 described above. Thereafter, potential participants in clinical trials of such an agent may be 

screened to identify those individuals most likely to respond favorably to the drug and exclude 
those likely to experience side effects. In that way, the effectiveness of drug treatment may be 
measured in individuals who respond positively to the drug, without lowering the measurement 
as a result of the inclusion of individuals who are unlikely to respond positively in the study and 
10 without risking undesirable safety problems. 

PREVENTION, DIAGNOSIS AND TREATMENT OF PSYCHIATRIC DISEASE 

An aspect of the present invention relates to the preparation of a medicament for the 
treatment of psychiatric disease, in particular schizophrenia and bipolar disorder. The present 

15 invention embodies medicaments acting on g35030. 

In preferred embodiments, medicaments of the invention act on g35030, either directly or 
indirectly, by acting on the g35030 pathways. For example, the medicaments may modulate, and 
more preferably decrease the level of g35030 activity which occurs in a cell or particular tissue, or 
increase or descrease the activity of the g35030 protein. In certain embodiments, the invention 

20 thus comprises use of a compound capable of increasing or decreasing g35030 expression or 

g35030 protein activity in the preparation or manufacture of a medicament. Preferably, said 
compound is used for the treatment of a psychiatric disease, preferably for the treatment of 
schizophrenia or bipolar disorder. Preferably, said compound acts directly by binding to g35030 
ofag35030 receptor. 

25 Such medicaments may also increase or decrease the activity of a compound analogous to 

g35O30, a compound comprising an amino acid sequence having at least 25% homology to a 
sequence selected from the group consisting of SEQ ID NOs. 1 8 to 23, a compound comprising an 
amino acid sequence having at least 50% homology to a sequence selected from the group 
consisting of SEQ ID NOs. 18 to 23, and a compound comprising an amino acid sequence having 

30 at least 80% homology to a sequence selected from the group consisting of SEQ ID NOs. 1 8 to 

23. 

Medicaments which increase or descrease the activity of these compounds in an 
individual may be used to ameliorate or prevent symptoms in individuals suffering from or 
predisposed to a psychiatric disease, as discussed above in the section entitled "indications". 
35 Alternatively, g35O30 activity may be increased or decreased by the expression of the 
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genes encoding the identified g35030-modulating compounds using gene therapy. Examples of 
vectors and promoters suitable for use in gene therapy are described above. G35030 activity may 
also be increased or decreased by preparing an antibody which binds to a g35030 peptide, a 
g35030 receptor or a protein related thereto, as well as fragments of these proteins. Such 
5 antibodies may modulate the interaction between g35O30 and a g35030 receptor or a protein 

related thereto. Antibodies and methods of obtaining them are further described herein. 

As described above, the present invention provides cellular assays for identifying 
compounds for the treatment of psychiatric disease. The assays are based on detection of g35030 
expression, measurement of g3503O protein activity, or based on the determination of other 

10 suitable. schizophrenia, bipolar disorder or related psychiatric disease endpoints. Compounds for 

the treatment of psychiatric disease include derivative proteins or peptides which are capable of 
inhibiting the activity of a wild type g35030 protein, which may be identified by determining their 
ability to bind a wild type g35030 protein. Compounds also include antibodies, and small 
molecules and drugs which may be obtained using a variety of synthetic approaches familiar to 

i 5 those skilled in the art, including combinatorial chemistry based techniques. 

The invention further encompasses said methods for the prevention, treatment, and 
diagnosis of disease using any of the g35030 nucleic acids of proteins of the invention in 
analogous methods. 

G35030 in Methods of Diagnosis or Detecting Predisposition 

20 Individuals affected by or predisposed to schizophrenia and bipolar disorder may express 

abnormal levels of g35030. Individuals having increased or decreased g35030 activity in their 
plasma, body fluids, or body tissues may be at risk of devloping schizophrenia, bipolar disorder or 
a variety of potentially related psychiatric conditions. In one aspect of the present invention is a 
method for determining whether an individual is at risk of suffering from or is currently suffering 

25 from schizophrenia, bipolar disorder or other psychotic disorders, mood disorders, autism, 

substance dependence or alcoholism, mental retardation, or other psychiatric diseases including 
cognitive, anxiety, eating, impulse-control, and personality disorders, as defined with the 
Diagnosis and Statistical Manual of Mental Disorders fourth edition (DSM-IV) classification, 
comprising determining whether the individual has an abnormal level of g35O30 activity in 

30 plasma, body fluids, or body tissues. The level of g35030 or analogous compounds in plasma, 

body fluids, or body tissues may be determined using a variety approaches. In particular, the level 
may be determined using EL1S A, Western Blots, or protein electrophoresis. 

Biallelic Markers Of The Invention In Methods Of Genetic Diagnostics 

The biallelic markers and other polymorphisms of the present invention can also be 

35 used to develop diagnostics tests capable of identifying individuals who express a detectable 
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trait as the result of a specific genotype or individuals whose genotype places them at risk of 
developing a detectable trait at a subsequent time. The trait analyzed using the present 
diagnostics may be used to diagnose any detectable trait, including predisposition to 
schizophrenia or bipolar disorder, age of onset of detectable symptoms, a beneficial response to 
5 or side effects related to treatment against schizophrenia or bipolar disorder. Such a diagnosis 

can be useful in the monitoring, prognosis and/or prophylactic or curative therapy for 
schizophrenia or bipolar disorder. 

The diagnostic techniques of the present invention may employ a variety of 
methodologies to determine whether a test subject has a genotype associated with an increased 
10 risk of developing a detectable trait or whether the individual suffers from a detectable trait as a 

result of a particular mutation, including methods which enable the analysis of individual 
chromosomes for haplotyping, such as family studies, single sperm DNA analysis or somatic 
hybrids. 

The diagnostic techniques concern the detection of specific alleles present within the 

15 human chromosome 13q3 l-q33 region, and optionally within a g35030 nucleic acid sequence. 

More particularly, the invention concerns the detection of a nucleic acid comprising at least one 
of the nucleotide sequences of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or a fragment 
thereof or a complementary sequence thereto including the polymorphic base. 

These methods involve obtaining a nucleic acid sample from the individual and, 

20 determining, whether the nucleic acid sample contains at least one allele or at least one bialielic 

marker haplotype, indicative of a risk of developing the trait or indicative that the individual 
expresses the trait as a result of possessing a particular the human chromosome 13q3 l-q33 
region-related polymorphism or mutation (trait-causing allele). 

Preferably, in such diagnostic methods, a nucleic acid sample is obtained from the 

25 individual and this sample is genotyped using methods described above in "Methods Of 

Genotyping DNA Samples For Bialielic markers." The diagnostics may be based on a single 
bialielic marker or a on group of bialielic markers. 

In each of these methods, a nucleic acid sample is obtained from the test subject and the 
bialielic marker pattern of one or more of a bialielic marker of the invention is determined. 

30 In one embodiment, a PCR amplification is conducted on the nucleic acid sample to 

amplify regions in which polymorphisms associated with a detectable phenotype have been 
identified. The amplification products are sequenced to determine whether the individual 
possesses one or more human chromosome 13q31-q33 region, g35030-related polymorphisms 
associated with a detectable phenotype. The primers used to generate amplification products 

35 may comprise the primers listed in Table 4. Alternatively, the nucleic acid sample is subjected 
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to microsequencing reactions as described above to determine whether the individual possesses 
one or more human chromosome 13q31-q33 region, g3 503 0-related polymorphisms associated 
with a detectable phenotype resulting from a mutation or a polymorphism in the human 
chromosome 13q3 1 -q33 region. The primers used in the microsequencing reactions may 
5 include the primers listed in Table 6. In another embodiment, the nucleic acid sample is 

contacted with one or more allele specific oligonucleotide probes which, specifically hybridize 
to one or more human chromosome 13q31-q33 region, g3 5 03 0-related alleles associated with a 
detectable phenotype. The probes used in the hybridization assay may include the probes listed 
in 6b. In another embodiment, the nucleic acid sample is contacted with a second 

10 oligonucleotide capable of producing an amplification product when used with the allele 

specific oligonucleotide in an amplification reaction. The presence of an amplification product 
in the amplification reaction indicates that the individual possesses one or more human 
chromosome 13q31-q33 region, g3 5 03 0-related alleles associated with a detectable phenotype. 
In a preferred embodiment, the detectable trait is schizophrenia or bipolar disorder. Diagnostic 

15 kits comprise any of the polynucleotides of the present invention. 

These diagnostic methods are extremely valuable as they can, in certain circumstances, 
be used to initiate preventive treatments or to allow an individual carrying a significant 
haplotype to foresee warning signs such as minor symptoms. 

Diagnostics, which analyze and predict response to a drug or side effects to a drug, may 

20 be used to determine whether an individual should be treated with a particular drug. For 

example, if the diagnostic indicates a likelihood that an individual will respond positively to 
treatment with a particular drug, the drug may be administered to the individual. Conversely, if 
the diagnostic indicates that an individual is likely to respond negatively to treatment with a 
particular drug, an alternative course of treatment may be prescribed. A negative response may 

25 be defined as either the absence of an efficacious response or the presence of toxic side effects. 

Clinical drug trials represent another application for the markers of the present^ 
invention. One or more markers indicative of response to an agent acting against schizophrenia 
or to side effects to an agent acting against schizophrenia may be identified using the methods 
described above. Thereafter, potential participants in clinical trials of such an agent may be 

30 screened to identify those individuals most likely to respond favorably to the drug and exclude 

those likely to experience side effects. In that way, the effectiveness of drug treatment may be 
measured in individuals who respond positively to the drug, without lowering the measurement 
as a result of the inclusion of individuals who are unlikely to respond positively in the study and 
without risking undesirable safety problems. 

35 Prevention And Treatment Of Disease Using Biallelic Markers 
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In large part because of the risk of suicide, the detection of susceptibility to 
schizophrenia, bipolar disorder as well as other psychiatric disease in individuals is very 
important. Consequently, the invention concerns a method for the treatment of schizophrenia or 
bipolar disorder, or a related disorder comprising the following steps: 
5 - selecting an individual whose DNA comprises alleles of a biallelic marker or of a group of 

biallelic markers of the human chromosome 13q3 l-q33 region, preferably g35030-related 
markers associated with schizophrenia or bipolar disorder, 

- following up said individual for the appearance (and optionally the development) of 
the symptoms related to schizophrenia or bipolar disorder; and 

1 0 - administering a treatment acting against schizophrenia or bipolar disorder or against symptoms . 
thereof to said individual at an appropriate stage of the disease. 

Another embodiment of the present invention comprises a method for the treatment of 
schizophrenia or bipolar disorder comprising the following steps: 

- selecting an individual whose DNA comprises alleles of a biallelic marker or of a 

1 5 group of biallelic markers, of the human chromosome 13q3 l-q33 region, and more preferably 

g35030-related markers associated with schizophrenia or bipolar disorder; 
- administering a preventive treatment of schizophrenia or bipolar disorder to said individual. 

In a further embodiment, the present invention concerns a method for the treatment of 
schizophrenia or bipolar disorder comprising the following steps: 
20 - selecting an individual whose DNA comprises alleles of a biallelic marker or of a 

group of biallelic markers of the human chromosome 13q31-q33, and more preferably g3503O- 
related markers associated with schizophrenia or bipolar disorder; 

- administering a preventive treatment of schizophrenia or bipolar disorder to said 
individual; 

25 - following up said individual for the appearance and the development of schizophrenia 

or bipolar disorder symptoms; and optionally 

- administering a treatment acting against schizophrenia or bipolar disorder or against 
symptoms thereof to said individual at the appropriate stage of the disease. 

For use in the determination of the course of treatment of an individual suffering from 
30 disease, the present invention also concerns a method for the treatment of schizophrenia or 

bipolar disorder comprising the following steps: 

- selecting an individual suffering from schizophrenia or bipolar disorder whose DNA 
comprises alleles of a biallelic marker or of a group of biallelic markers of the human 
chromosome 13q3 l-q33 region, and preferably g35030-related markers, associated with the 

35 gravity of schizophrenia or bipolar disorder or of the symptoms thereof; and 
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- administering a treatment acting against schizophrenia or bipolar disorder or 
symptoms thereof to said individual. 

The invention also concerns a method for the treatment of schizophrenia or bipolar 
disorder in a selected population of individuals. The method comprises: 
5 - selecting an individual suffering from schizophrenia or bipolar disorder and whose 

DNA comprises alleles of a biallelic marker or of a group of biallelic markers of the human 
chromosome 13q3 l-q33 region, and more preferably g35O30-related markers associated with a 
positive response to treatment with an effective amount of a medicament acting against 
schizophrenia or bipolar disorder or symptoms thereof, 
10 - and/or whose DNA does not comprise alleles of a biallelic marker or of a group of 

biallelic markers of the human chromosome 13q3 l-q33 region, and more preferably g35030- 
related markers associated with a negative response to treatment with said medicament; and 

- administering at suitable intervals an effective amount of said medicament to said 
selected individual. 

15 In the context of the present invention, a "positive response" to a medicament can be 

defined as comprising a reduction of the symptoms related to the disease. In the context of the 
present invention, a "negative response** to a medicament can be defined as comprising either a 
lack of positive response to the medicament which does not lead to a symptom reduction or 
which leads to a side^effect observed following administration of the medicament. 

20 The invention also relates to a method of determining whether a subject is likely to 

respond positively to treatment with a medicament. The method comprises identifying a first 
population of individuals who respond positively to said medicament and a second population 
of individuals who respond negatively to said medicament. One or more biallelic markers is 
identified in the first population which is associated with a positive response to said medicament 

25 or one or more biallelic markers is identified in the second population which is associated with a 

negative response to said medicament. The biallelic markers may be identified using the 
techniques described herein, 

A DNA sample is then obtained from the subject to be tested. The DNA sample is 
analyzed to determine whether it comprises alleles of one or more biallelic markers associated 

30 with a positive response to treatment with the medicament and/or alleles of one or more biallelic 

markers associated with a negative response to treatment with the medicament. 

In some embodiments, the medicament may be administered to the subject in a clinical 
trial if the DNA sample contains alleles of one or more biallelic markers associated with a 
positive response to treatment with the medicament and/or if the DNA sample lacks alleles of 

35 one or more biallelic markers associated with a negative response to treatment with the 
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medicament. In preferred embodiments, the medicament is a drug acting against schizophrenia 
or bipolar disorder. 

Using the method of the present invention, the evaluation of drug efficacy may be 
conducted in a population of individuals likely to respond favorably to the medicament. 
5 Another aspect of the invention is a method of using a medicament comprising 

obtaining a DNA sample from a subject, determining whether the DNA sample contains alleles 
of one or more biallelic markers associated with a positive response to the medicament and/or 
whether the DNA sample contains alleles of one or more biallelic markers associated with a 
negative response to the medicament, and administering the medicament to the subject if the 
10 DNA sample contains alleles of one or more biallelic markers associated with a positive 

response to the medicament and/or if the DNA sample lacks alleles of one or more biallelic 
markers associated with a negative response to the medicament 

The invention also concerns a method for the clinical testing of a medicament, 
preferably a medicament acting against schizophrenia or or bipolar disorder or symptoms 
15 thereof. The method comprises the following steps: 

- administering a medicament, preferably a medicament susceptible of acting against 
schizophrenia or or bipolar disorder or symptoms thereof to a heterogeneous population of 
individuals, 

- identifying a first population of individuals who respond positively to said 
20 medicament and a second population of individuals who respond negatively to said 

medicament, 

- identifying biallelic markers in said first population which are associated with a 
positive response to said medicament, 

- selecting individuals whose DNA comprises biallelic markers associated with a 
25 positive response to said medicament, and 

- administering said medicament to said individuals. 

In any of the methods for the prevention, diagnosis and treatment of schizophrenia and 
bipolar disorder, including methods of using a medicament, clinical testing of a medicament, 
determining whether a subject is likely to respond positively to treatment with a medicament, 
30 said biallelic marker may optionally comprise: 

(a) a biallelic marker selected from the group consisting of g3 503 0-related markers A13 
to A65; (b) a biallelic marker selected from the group consisting of chromosome 13q31-q33- 
related-biallelic markers A13 to Al 8, A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to 
A73, A76 and A123 ; or (c) a biallelic marker selected from the group consisting of 
35 chromosome 13q3 l-q33-related-biallelic markers A13 to A18, A20 to A47, A49 to A52, A55, 
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A57, A59 to A63. Furthermore, any of the embodiments of the invention may specifically 
exclude one or more of the biallelic markers selected from biallelic markers A 1 to A 127. 

Such methods are deemed to be extremely useful to increase the benefit/risk ratio 
resulting from the administration of medicaments which may cause undesirable side effects 
5 and/or be inefficacious to a portion of the patient population to which it is normally 

administered. 

Once an individual has been diagnosed as suffering from schizophrenia or bipolar 
disorder, selection tests are carried out to determine whether the DNA of this individual 
comprises alleles of a biallelic marker or of a group of biallelic markers associated with a 

10 positive response to treatment or with a negative response to treatment which may include either 

side effects or unresponsiveness. 

The selection of the patient to be treated using the method of the present invention can 
be carried out through the detection methods described above. The individuals which are to be 
selected are preferably those whose DNA does not comprise alleles of a biallelic marker or of a 

15 group of biallelic markers associated with a negative response to treatment. The knowledge of 

an individual's genetic predisposition to unresponsiveness or side effects to particular 
medicaments allows the clinician to direct treatment toward appropriate drugs against 
schizophrenia or bipolar disorder or symptoms thereof. 

Once the patient's genetic predispositions have been determined, the clinician can select 

20 appropriate treatment for which negative response, particularly side effects, has not been 

reported or has been reported only marginally for the patient. 

The biallelic markers of the invention have demonstrated an association with 
schizophrenia and bipolar disorders. However, the present invention also comprises any of the 
prevention, diagnostic, prognosis and treatment methods described herein using the biallelic 

25 markers of the invention in methods of preventing, diagnosing, managing and treating related 

disorders, particularly related CNS disorders. By way of example, related disorders may comprise 
psychotic disorders, mood disorders, autism, substance dependence and alcoholism, mental 
retardation, and other psychiatric diseases including cognitive, anxiety, eating, impulse-control, 
and personality disorders, as defined with the Diagnosis and Statistical Manual of Mental 

30 Disorders fourth edition (DSM-IV) classification". 

Recombinant Vectors 

The term "vector" is used herein to designate either a circular or a linear DNA or RNA 
molecule, which is either double-stranded or single-stranded, and which comprise at least one 
polynucleotide of interest that is sought to be transferred in a cell host or in a unicellular or 
35 multicellular host organism. 
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The present invention encompasses a family of recombinant vectors that comprise a 
polynucleotide derived from a g35030 nucleic acid sequence. Consequently, the present 
invention further comprises recombinant vectors comprising: g35030 genomic DNA or cDNAs 
comprised in the nucleic acids of any of nucleotide positions 201 123 to 201234, 201 123 to 
5 201560, 214676 to 214793 7 215702 to 215746, 2 16836 to 216994, 216836 to 217077, 217671 

to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 240440 to 24 1 1 53, 
241072 to 241291, 244353 to 244561, 246273 to 247802,201123 to 247802, 199122 to 
201 122, 247803 to 249803 and 199122 to 249803 of SEQ ID No, 1, SEQ ID Nos 2 to 17 and 
primate g3 5030 DNAs of SEQ ID Nos 79 to 1 32, and the complements thereof; 

10 Generally, a recombinant vector of the invention may comprise any of the 

polynucleotides described herein, as well as any g35030 primer or probe as defined above. 

In a first preferred embodiment, a recombinant vector of the invention is used to 
amplify the inserted polynucleotide derived from a g35030 genomic sequence or cDNA of the 
invention in a suitable cell host, this polynucleotide being amplified at every time that the 

15 recombinant vector replicates. 

A second preferred embodiment of the recombinant vectors according to the invention 
comprises expression vectors comprising either a regulatory polynucleotide or a coding nucleic 
acid of the invention, or both. Within certain embodiments, expression vectors are employed to 
express a g35030 polypeptide which can be then purified and, for example be used in ligand 

20 screening assays or as an immunogen in order to raise specific antibodies directed against a 

g35030 protein. In other embodiments, the expression vectors are used for constructing 
transgenic animals and also for gene therapy. Expression requires that appropriate signals are 
provided in the vectors, said signals including various regulatory elements, such as 
enhancers/promoters from both viral and mammalian sources that drive expression of the genes 

25 of interest in host cells. Dominant drug selection markers for establishing permanent, stable cell 

clones expressing the products are generally included in the expression vectors of the invention, 
as they are elements that link expression of the drug selection markers to expression of the 
polypeptide. 

More particularly, the present invention relates to expression vectors which include 
30 nucleic acids encoding a g35030 protein or variants or fragments thereof, under the control of a 

regulatory sequence of the respective g35030 regulatory polynucleotides, or alternatively under 
the control of an exogenous regulatory sequence. 

The invention also pertains to a recombinant expression vector useful for the expression 
of a g35030 cDNA sequence. 
35 Recombinant vectors comprising a nucleic acid containing a human chromosome 
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13q31-33-related biailelic marker, or more preferably a g35030-related bialielic marker is also 
part of the invention. In a preferred embodiment, said biailelic marker is selected from the 
group consisting of A13 to A65 and the complements thereof. 

Some of the elements which can be found in the vectors of the present invention are 
5 described in further detail in the following sections. 

1, General features of the expression vectors of the invention 

A recombinant vector according to the invention comprises, but is not limited to, a 
YAC (Yeast Artificial Chromosome), a BAC (Bacterial Artificial Chromosome), a phage, a 
phagemid, a cosmid, a plasmid or even a linear DNA molecule which may comprise a 
10 chromosomal, non-chromosomal, semi-synthetic and synthetic DNA. Such a recombinant 

vector can comprise a transcriptional unit comprising an assembly of: 

(l) a genetic element or elements having a regulatory role in gene expression, for 
example promoters or enhancers. Enhancers are cis-acting elements of DNA, usually from 
about 10 to 300 bp in length that act on the promoter to increase the transcription. 
15 (2) a structural or coding sequence which is transcribed into mRNA and eventually 

translated into a polypeptide, said structural or coding sequence being operably linked to the 
regulatory elements described in (1); and 

(3) appropriate transcription initiation and termination sequences. Structural units 
intended for use in yeast or eukaryotic expression systems preferably include a leader sequence 
20 enabling extracellular secretion of translated protein by a host cell. Alternatively, when a 

recombinant protein is expressed without a leader or transport sequence, it may include a N- 
terminal residue. This residue may or may not be subsequently cleaved from the expressed 
recombinant protein to provide a final product. 

Generally, recombinant expression vectors will include origins of replication, selectable 
25 markers permitting transformation of the host cell, and a promoter derived from a highly 

expressed gene to direct transcription of a downstream structural sequence. The heterologous 
structural sequence is assembled in appropriate phase with translation initiation and termination 
sequences, and preferably a leader sequence capable of directing secretion of the translated 
protein into the periplasmic space or the extracellular medium. In a specific embodiment 
3D wherein the vector is adapted for transfecting and expressing desired sequences in mammalian 

host cells, preferred vectors will comprise an origin of replication in the desired host, a suitable 
promoter and enhancer, and also any necessary ribosome binding sites, polyadenylation site, 
splice donor and acceptor sites, transcriptional termination sequences, and 5 -flanking non- 
transcribed sequences. DNA sequences derived from the SV40 viral genome, for example 
35 SV40 origin, early promoter, enhancer, splice and polyadenylation sites may be used to provide 
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the required non-transcribed genetic elements. 

The in vivo expression of a g35030 polypeptide or fragments or variants thereof may be 
useful in order to correct a genetic defect related to the expression of the native gene in a host 
organism or to the production of a biologically inactive g3S030 protein. 
5 Consequently, the present invention also comprises recombinant expression vectors 

mainly designed for the in vivo production of the g35030 polypeptide by the introduction of the 
appropriate genetic material in the organism of the patient to be treated. In preferred 
embodiments, said genetic material comprises at least one nucleotide sequence selected from 
the group of nucleotide posittion ranges consisting of: g3503O genomic DNA or cDNAs 

1 0 comprised in the nucleic acids of any of nucleotide positions 201 123 to 201 234, 20 1 1 23 to 

201 560, 2 14676 to 2 1 4793, 2 1 5702 to 2 1 5746, 2 16836 to 21 6994, 2 1 6836 to 2 1 7077, 2 1 767 1 
to 217764, 227655 to 227736, 23871 5 to 23893 9, 240440 to 240673, 240440 to 241 153, 
241072 to 241291, 244353 to 244561, 246273 to 247802, 201 123 to 247802, 199122 to 
201 122, 247803 to 249803 and 199122 to 249803 of SEQ ID No. 1, SEQ ID Nos 2 to 17 and 

1 5 primate g3 5030 DNAs of SEQ ID Nos 79 to 1 32, and the complements thereof; 

This genetic material may be introduced in vitro in a cell that has been previously 
extracted from the organism, the modified cell being subsequently reintroduced in the said 
organism, directly in vivo into the appropriate tissue. 
2. Regulatory Elements 

20 Promoters 

The suitable promoter regions used in the expression vectors according to the present 
invention are chosen taking into account the cell host in which the heterologous gene has to be 
expressed. The particular promoter employed to control the expression of a nucleic acid 
sequence of interest is not believed to be important, so long as it is capable of directing the 

25 expression of the nucleic acid in the targeted cell. Thus, where a human cell is targeted, it is 

preferable to position the nucleic acid coding region adjacent to and under the control of a 
promoter that is capable of being expressed in a human cell, such as, for example, a human or a 
viral promoter. 

A suitable promoter may be heterologous with respect to the nucleic acid for which it 
30 controls the expression or alternatively can be endogenous to the native polynucleotide 

containing the coding sequence to be expressed. Additionally, the promoter is generally 
heterologous with respect to the recombinant vector sequences within which the construct 
promoter/coding sequence has been inserted. 

Promoter regions can be selected from any desired gene using, for example, CAT 
35 (chloramphenicol transferase) vectors and more preferably pKK232-8 and pCM7 vectors. 
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Preferred bacterial promoters are the Lad, LacZ, the T3 or 17 bacteriophage RNA 
polymerase promoters, the gpt, lambda PR, PL and trp promoters (EP 0036776), the polyhedrin 
promoter, or the'plO protein promoter from baculovims (KitNovagen) (Smith et al., 1983; 
O'Reilly et al 1992), the lambda PR promoter or also the trc promoter 
5 Eukaryotic promoters include CMV immediate early, HSV thymidine kinase, early and 

late SV40, LTRs from retrovirus, and mouse metallothionein-L. Selection of a convenient 
vector and promoter is well within the level of ordinary skill in the art. 

The choice of a promoter is well within the ability of a person skilled in the field of 
genetic engineering. For example, one may refer to the book of Sambrook et al.(1989) or also 
10 to the procedures described by Fuller et al.(1996). 

Other regulatory elements 

One will typically desire to include a polyadenylation signal to effect proper 
poiyadenylation of the gene transcript. The nature of the polyadenylation signal is not believed 
to be crucial to the successful practice of the invention, and any such sequence may be 

15 employed such as human growth hormone and SV40 polyadenylation signals. Also 

contemplated as an element of the expression cassette is a terminator. These elements can serve 
to enhance message levels and to minimize read through from the cassette into other sequences. 

The vector containing the appropriate DN A sequence as described above, more 
preferably a g35030 gene regulatory polynucleotide, a polynucleotide encoding a g35030 

20 polypeptide comprising at least one nucleotide sequence selected from the group of nucleotide 

sequence ranges consisting of: g35O30 genomic DNA or cDNAs comprised in the nucleic acids 
of any of nucleotide positions 201123 to 201234, 201 123 to 201 560, 214676 to 214793, 215702 
to 215746, 21 6836 to 216994, 216836 to 217077, 217671 to 217764, 227655 to 227736, 
238715 to 238919, 240440 to 240673, 240440 to 241 153, 241072 to 241291, 244353 to 

25 244561, 246273 to 247802, 201 123 to 247802, 199122 to 201 122, 247803 to 249803 and 

1 99122 to 249803 of SEQ ID No. 1 , SEQ ID Nos 2 to 17 and primate g3 5030 DNAs of SEQ ID 
Nos 79 to 1 32, and the complements thereof. 
3. Selectable Markers 

Such markers would confer an identifiable change to the cell permitting easy 
30 identification of cells containing the expression construct. The selectable marker genes for 

selection of transformed host cells are preferably dihydrofolate reductase or neomycin 
resistance for eukaryotic cell culture, TRP1 for S. cerevisiae or tetracycline, rifampicin or 
ampicillin resistance in E. coli, or levan saccharase for mycobacteria, this latter marker being a 
negative selection marker. 
35 4. Preferred Vectors. 
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Bacterial vectors 

As a representative but non-limiting example, useful expression vectors for bacterial 
use can comprise a selectable marker and a bacterial origin of replication derived from 
commercially available plasmids comprising genetic elements of pBR322 (ATCC 3701 7). Such 
5 commercial vectors include, for example, pKK223-3 (Pharmacia, Uppsala, Sweden), and 

GEM1 (Promega Biotec, Madison, WI, USA). 

Large numbers of other suitable vectors are known to those of skill in the art, and 
commercially available, such as the following bacterial vectors: pQE70, pQE60, pQE-9 
(Qiagen), pbs, pDIO, phagescript, psiX174, pbluescript SK, pbsks, pNH8A, pNH16A s 
1 0 pNHl 8A, pNH46A (Stratagene); ptrc99a, pKK223-3, pKK233-3, pDR540, pRIT5 

(Pharmacia); pWLNEO, PSV2CAT, pOG44, pXTl, pSG (Stratagene); pSVK3, pBPV, pMSG, 
pSVL (Pharmacia); pQE-30 (QIAexpress). 

Bacteriophage vectors 

The PI bacteriophage vector may contain large inserts ranging from about 80 to about 

15 100 kb. 

The construction of PI bacteriophage vectors such as pl58 or pl58/neo& are notably 
described by Sternberg (1992, 1994). Recombinant PI clones comprising g35030 
polynucleotide sequences may be designed for inserting large polynucleotides of more than 40 
kb (Linton et ah, 1993). To generate PI DNA for transgenic experiments, a preferred protocol 

20 is the protocol described by McCormick et al.( 1 994). Briefly, E. coli (preferably strain 

NS3529) harboring the PI plasmid are grown overnight in a suitable broth medium containing 
25 tig/ml of kanamycin. The PI DMA is prepared from the R coli by alkaline lysis using the 
Qiagen Plasmid Maxi kit (Qiagen, Chatsworth, CA, USA), according to the manufacturer's 
instructions. The PI DNA is purified from the bacterial lysate on two Qiagen-tip 500 columns, 

25 using the washing and elution buffers contained in the kit, A phenol/chloroform extraction is 

then performed before precipitating the DNA with 70% ethanol. After solubilizing the DNA in 
TE (10 mM Tris-HCl, pH 7.4, 1 mM EDTA), the concentration of the DNA is assessed by 
spectrophotometry. 

When the goal is to express a PI clone comprising a g3503O polynucleotide sequence in 
30 a transgenic animal, typically in transgenic mice, it is desirable to remove vector sequences 

from the PI DNA fragment, for example by cleaving the PI DNA at rare-cutting sites within die 
P 1 polylinker (Sfil, Notl or Sail). The PI insert is then purified from vector sequences on a 
pulsed-field agarose gel, using methods similar using methods similar to those originally 
reported for the isolation of DNA from YACs (Schedi et al., 1993a; Peterson et al. 9 1993, ). At 
35 this stage, the resulting purified insert DNA can be concentrated, if necessary, on a Millipore 
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Ultrafree-MC Filter Unit (Millipore, Bedford, MA, USA - 30,000 molecular weight limit) and 
then dialyzed against microinjection buffer (10 mM Tris-HCI, pH 7.4; 250 uM EDTA) 
containing 100-mM NaCI, 30 uM spermine, 70 uM spermidine on a microdyalisis membrane 
(type VS, 0.025 uM from Millipore). The intactness of the purified PI DNA insert is assessed 
5 by electrophoresis on 1% agarose (Sea Kern GTG; FMC Bio-products) pulse-field gel and 

staining with ethidium bromide. 
Baculovirus vectors 

A suitable vector for the expression of a g35030 polypeptide encoded by a 
polynucleotide of SEQ ID Nos. 1 to 1 7 or fragments or variants thereof is a baculovirus vector 
10 that can be propagated in insect cells and in insect cell lines. A specific suitable host vector 

system is the pVLl 392/1 393 baculovirus transfer vector (Pharmingen) that is used to transfect 
the SF9 eel! line (ATCC N°CRL 1711) which is derived from Spodoptera frugiperda. 

Other suitable vectors for the expression of the g3503O polypeptide encoded by the 
polynucleotides of SEQ ID Nos. 1 to 17, or fragments or variants thereof in a baculovirus 
1 5 expression system include those described by Chat et al.(l 993), Vlasak et al.( 1 983) and Lenhard 

etal.(1996). 

Viral vectors 

In one specific embodiment, the vector is derived from an adenovirus. Preferred 
adenovirus vectors according to the invention are those described by Feldman and Steg (1996) 
20 or Ohno et al.(1994). Another preferred recombinant adenovirus according to this specific 

embodiment of the present invention is the human adenovirus type 2 or 5 (Ad 2 or Ad 5) or an 
adenovirus of animal origin (French patent application N° FR-93.05954). 

Retrovirus vectors and adeno-associated virus vectors are generally understood to be 
the recombinant gene delivery systems of choice.for the transfer of exogenous polynucleotides 
25 in vivo, particularly to mammals, including humans. These vectors provide efficient delivery of 

genes into cells, and the transferred nucleic acids are stably integrated into the chromosomal 
DNA of the host. 

Particularly preferred retroviruses for the preparation or construction of retroviral in 
vitro or in vitro gene delivery vehicles of the present invention include retroviruses selected 

30 from the group consisting of Mink-Cell Focus Inducing Virus, Murine Sarcoma Virus, 

Reticuloendotheliosis virus and Rous Sarcoma virus. Particularly preferred Murine Leukemia 
Viruses include the 4070 A and the 1504A viruses, Abelson (ATCC No VR-999), Friend 
(ATCC No VR-245), Gross (ATCC No VR-590), Rauscher (ATCC No VR-998) and Moloney 
Murine Leukemia Virus (ATCC No VR-190; PCT Application No WO 94/24298). Particularly 

35 preferred Rous Sarcoma Viruses include Bryan high titer (ATCC Nos VR-334, VR-657, VR- 
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726, VR-659 and VR-728). Other preferred retroviral vectors are those described in Roth et 
al(1996), PCT Application No WO 93/25234, PCT Application No WO 94/ 0692O, Roux et al., 
1 989, Julan et ah, 1 992 and Neda et al., 1991 . 

Yet another viral vector system that is contemplated by the invention comprises the 
5 adeno-associated virus (AAV). The adeno-associated virus is a naturally occurring defective 

virus that requires another virus, such as an adenovirus or a herpes virus, as a helper virus for 
efficient replication and a productive life cycle (Muzyczka et al., 1992). It is also one of the 
few viruses that may integrate its DNA into non-dividing cells, and exhibits a high frequency of 
stable integration (Flotte et al., 1992; Samulski et al., 1989; McLaughlin et al, 1989). One 
10 advantageous feature of AAV derives from its reduced efficacy for transducing primary ceils 

relative to transformed cells. 

BAC vectors 

The bacterial artificial chromosome (BAC) cloning system (Shizuyaet al., 1992) has 
been developed to stably maintain large fragments of genomic DNA (100-300 kb) in E. coli. A 

1 5 preferred BAC vector comprises pBeloB AC 1 1 vector that has been described by Kim et 

aL(1996). BAC libraries are prepared with this vector using size-selected genomic DNA that 
has been partially digested using enzymes that permit ligation into either the Bam HI or HindUl 
sites in the vector. Flanking these cloning sites are T7 and SP6 RNA polymerase transcription 
initiation sites that can be used to generate end probes by either RNA transcription or PCR 

20 methods. After the construction of a BAC library in £. coli, BAC DNA is purified from the 

host cell as a supercoiled circle. Converting these circular molecules into a linear form precedes 
both size determination and introduction of the BACs into recipient cells. The cloning site is 
flanked by two Not I sites, permitting cloned segments to be excised from the vector by Not I 
digestion. Alternatively, the DNA insert contained in the pBeloBACl 1 vector may be 

25 linearized by treatment of the BAC vector with the commercially available enzyme lambda 

terminase that leads to the cleavage at the unique cosN site, but this cleavage method results in a 
full length BAC clone containing both the insert DNA and the BAC sequences. 
5. Delivery Of The Recombinant Vectors 

In order to effect expression of the polynucleotides and polynucleotide constructs of the 
30 invention, these constructs must be delivered into a cell. This delivery may be accomplished in 

vitro, as in laboratory procedures for transforming cell lines, or in vivo or ex vivo, as in the 
treatment of certain diseases states. 

One mechanism is viral infection where the expression construct is encapsulated in an 
infectious viral particle, 

35 Several non-viral methods for the transfer of polynucleotides into cultured mammalian 
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cells are also contemplated by the present invention, and include, without being limited to, 
calcium phosphate precipitation (Graham et al y 1 973; Chen et ah, 1987), DEAE-dextran 
(Gopal, 1985), electroporation (Tur-Kaspa et aL, 1986; Potter et al., 1984), direct 
microinjection (Harland et al., 1985), DNA-loaded liposomes (Nicolau et al., 1982; Fraley et 
5 al., 1979), and receptor-mediate transfection (Wu and Wu, 1987; 1988). Some of these 

techniques may be successfully adapted for in vivo or ex vivo use. 

Once the expression polynucleotide has been delivered into the cell, it may be stably 
integrated into the genome of the recipient cell. This integration may be in the cognate location 
and orientation via homologous recombination (gene replacement) or it may be integrated in a 
10 random, non specific location (gene augmentation). In yet further embodiments, the nucleic 

acid may be stably maintained in the cell as a separate, episomal segment of DNA. Such 
nucleic acid segments or "episomes" encode sequences sufficient to permit maintenance and 
replication independent of or in synchronization with the host cell cycle. 

One specific embodiment for a method for delivering a protein or peptide to the interior 
15 of a cell of a vertebrate in vivo comprises the step of introducing a preparation comprising a 

physiologically acceptable carrier and a naked polynucleotide operatively coding for the 
polypeptide of interest into the interstitial space of a tissue comprising the cell, whereby the 
naked polynucleotide is taken up into the interior of the cell and has a physiological effect. This 
is particularly applicable for transfer in vitro but it may be applied to in vivo as well. 
20 Compositions for use in vitro and in vivo comprising a "naked" polynucleotide are 

described in PCT application N° WO 90/1 1092 (Vical Inc.) and also in PCT application No. 
WO 95/1 1307 (Institut Pasteur, INSERM, Universite d'Ottawa) as well as in the articles of 
Tacson et al.(1996) and of Huygen et al.(1996). 

In still another embodiment of the invention, the transfer of a naked polynucleotide of 
25 the invention, including a polynucleotide construct of the invention, into cells may be proceeded 

with a particle bombardment (biolistic), said particles being DNA-coated microprojectiles 
accelerated to a high velocity allowing them to pierce cell membranes and enter cells without 
killing them, such as described by Klein et al.(l 987). 

In a further embodiment, the polynucleotide of the invention may be entrapped in a 
30 liposome (Ghosh and Bacchawat, 1991; Wong et ah, 1980; Nicolau et al., 1987). 

In a specific embodiment, the invention provides a composition for the in vivo 
production of the g35030 protein or polypeptide described herein. It comprises a naked 
polynucleotide operatively coding for this polypeptide, in solution in a physiologically 
acceptable carrier, and suitable for introduction into a tissue to cause cells of the tissue to 
35 express the said protein or polypeptide. 
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The amount of vector to be injected to the desired host organism varies according to the 
site of injection. As an indicative dose, it will be injected between 0>1 and 100 ug of the vector 
in an animal body, preferably a mammal body, for example a mouse body. 

In another embodiment of the vector according to the invention, it may be introduced in 
5 vitro in a host cell, preferably in a host cell previously harvested from the animal to be treated 

and more preferably a somatic cell such as a muscle cell. In a subsequent step, the cell that has 
been transformed with the vector coding for the desired g35030 polypeptide or the desired 
fragment thereof is reintroduced into the animal body in order to deliver the recombinant 
protein within the body either locally or systemically. 
10 Cell Hosts 

Another object of the invention comprises a host cell that have been transformed or 
transfected with one of the polynucleotides described herein, and more precisely a 
polynucleotide comprising a g35030 polynucleotide selected from the group consisting of SEQ 
ID Nos. 1 to 1 7 and 79 to 132, or a fragment or a variant thereof. Are included host cells that 
15 are transformed (prokaryotic cells) or that are transfected (eukaryotic cells) with a recombinant 

vector such as one of those described above. 

Generally, a recombinant host cell of the invention comprises any one of the 
polynucleotides or the recombinant vectors described therein. 

Preferred host cells used as recipients for the expression vectors of the invention are the 
20 following: 

a) Prokaryotic host cells: Escherichia coli strains (I.E.DHS-ct strain), Bacillw subtilis, 
Salmonella typhimvrium> and strains from species like Pseudomonas^ Streptomyces and 
Staphylococcus* 

b) Eukaryotic host cells: HeLa cells (ATCC N°CCL2; N°CCL2.l; M°CCL2.2), Cv 1 
25 cells (ATCC N°CCL70), COS cells (ATCC N°CRL1 650; N°CRL1651), Sf-9 cells (ATCC 

N°CRL171 1), C127 cells (ATCC N° CRL-1 804), 3T3 (ATCC N° CRL-6361), CHO (ATCC N° 
CCL-61), human kidney 293. (ATCC N° 45504; N° CRL-1 573) and BHK (ECACC N° 
84100501; N° 841 11301). 

c) Other mammalian host cells. 

30 G35O30 gene expression in mammalian, and typically human, cells may be rendered 

defective with the replacement of a g35030 nucleic acid counterpart in the genome of an animal 
cell by a g35030 polynucleotide according to the invention. These genetic alterations may be 
generated by homologous recombination events using specific DNA constructs that have been 
previously described. 

35 One kind of cell hosts that may be used are mammal zygotes, such as murine zygotes. 
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For example, murine zygotes may undergo microinjection with a purified DNA molecule of 
interest, for example a purified DNA molecule that has previously been adjusted to a 
concentration range from 1 ng/mt -for BAC inserts- 3 ng/^I -for PI bacteriophage inserts- in 10 
mM Tris-HCl, pH 7.4, 250 pM EDTA containing 100 mM NaCl, 30 pM spermine, and70 pM 
5 spermidine. When the DNA to be microinjected has a large size, polyamines and high salt 

concentrations can be used in order to avoid mechanical breakage of this DNA, as described by 
Schedletal (1993 b). 

Any of the polynucleotides of the invention, including the DNA constructs described 
herein, may be introduced in an embryonic stem (ES) cell line, preferably a mouse ES cell line* 
10 ES cell lines are derived from pluripotent, uncommitted ceils of the inner cell mass of pre- 

implantation blastocysts. Preferred ES cell lines are the following: ES-E14TG2a (ATCC n° 
CRL-1821), ES-D3 (ATCC n° CRL1934 and n° CRL-1 1632), YS001 (ATCC n° CRL-1 1776), 
36.5 (ATCC n° CRL-1 1116). To maintain ES cells in an uncommitted state, they are cultured 
in the presence of growth inhibited feeder cells which provide the appropriate signals to 
15 preserve this embryonic phenotype and serve as a matrix for ES cell adherence. Preferred 

feeder cells are primary embryonic fibroblasts that are established from tissue of day 13- day 14 
embryos of virtually any mouse strain, that are maintained in culture, such as described by 
Abbondanzo et al.(1993) and are inhibited in growth by irradiation, such as described by 
Robertson (1987), or by the presence of an inhibitory concentration of LIF, such as described by 
20 Pease and Williams (1990). 

The constructs in the host cells can be used in a conventional manner to produce the 
gene product encoded by the recombinant sequence. 

Following transformation of a suitable host and growth of the host to an appropriate cell 
density, the selected promoter is induced by appropriate means, such as temperature shift or 
25 chemical induction, and cells are cultivated for an additional period. 

Cells are typically harvested by centrifugation, disrupted by physical or chemical 
means, and the resulting crude extract retained for further purification. 

Microbial cells employed in the expression of proteins can be disrupted by any 
convenient method, including freeze-thaw cycling, sonication, mechanical disruption, or use of 
30 cell lysing agents. Such methods are well known by the skill artisan. 

Transgenic Animals 

The terms "transgenic animals" or "host animals" are used herein designate animals that 
have their genome genetically and artificially manipulated so as to include one of the nucleic 
acids according to the invention. Preferred animals are non-human mammals and include those 
35 belonging to a genus selected from Mus (e.g. mice), Rattus (e.g. rats) and Oryctogalus (e.g. 
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rabbits) which have their genome artificially and genetically altered by the insertion of a 
nucleic acid according to the invention. In one embodiment, the invention encompasses non- 
human host mammals and animals comprising a recombinant vector of the invention or a 
g35030 gene disrupted by homologous recombination with a knock out vector. The invention 
5 also encompasses non-human primates comprising a recombinant vector of the invention or a 

g35030 gene disrupted by homologous recombination with a knock out vector. 

The transgenic animals of the invention all include within a plurality of their cells a 
cloned recombinant or synthetic DNA sequence, more specifically one of the purified or 
isolated nucleic acids comprising a g35030 polynucleotide or a DNA sequence encoding an 
1 0 antisense polynucleotide such as described in the present specification. 

Generally, a transgenic animal according the present invention comprises any one of the 
polynucleotides, the recombinant vectors and the cell hosts described in the present invention. 

In a first preferred embodiment, these transgenic animals may be good experimental 
models in order to study the diverse pathologies related to cell differentiation, in particular 
1 5 concerning the transgenic animals within the genome of which has been inserted one or several 

copies of a polynucleotide encoding a native g35O30 protein, or alternatively a mutant g35030 
protein. 

In a second preferred embodiment, these transgenic animals may express a desired 
polypeptide of interest under the control of regulatory polynucleotides which lead to good 
20 yields in the synthesis of this protein of interest; and optionally a tissue specific expression of 

this protein of interest 

The design of the transgenic animals of the invention may be made according to the 
conventional techniques well known from the one skilled in the art. For more details regarding 
the production of transgenic animals, and specifically transgenic mice, it may be referred to US 
25 Patents Nos 4,873,191, issued Oct. 10, 1989; 5,464,764 issued Nov 7, 1995; and 5,789,21 5, 

issued Aug 4, 1998. 

Transgenic animals of the present invention are produced by the application of 
procedures which result in an animal with a genome that has incorporated exogenous genetic 
material. The procedure involves obtaining the genetic material, or a portion thereof, which 
30 encodes either a g35O30 polynucleotide or antisense polynucleotide such as described in the 

present specification. 

A recombinant polynucleotide of the invention is inserted into an embryonic or ES stem 
cell line. The insertion is preferably made using electroporation, such as described by Thomas 
et al.(1987). The cells subjected to electroporation are screened (e.g. by selection via selectable 
35 markers, by PCR or by Southern blot analysis) to find positive cells which have integrated the 
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exogenous recombinant polynucleotide into their genome, preferably via an homologous 
recombination event. An illustrative positive-negative selection procedure that may be used 
according to the invention is described by Mansour et aL( 1988). 

Then, the positive cells are isolated, cloned and injected into 3.5 days old blastocysts 
5 from mice, such as described by Bradley (1987). The blastocysts are then inserted into a female 

host animal and allowed to grow to term. 

Alternatively, the positive ES cells are brought into contact with embryos at the 2.5 
days old 8-16 cell stage (morulae) such as described by Wood et al,(1993) or by Nagy et 
al.(1993), the ES cells being internalized to colonize extensively the blastocyst including the 
1 0 cells which will give rise to the germ tine. 

The offspring of the female host are tested to determine which animals are transgenic 
e.g. include the inserted exogenous DNA sequence and which are wild-type. 

Thus, the present invention also concerns a transgenic animal containing a nucleic acid, 
a recombinant expression vector or a recombinant host cell according to the invention. 
15 Recombinant Cell Lines Derived From The Transgenic Animals Of The Invention. 

A further object of the invention comprises recombinant host cells obtained from a 
transgenic animal described herein. In one embodiment the invention encompasses cells 
derived from non-human host mammals and animals comprising a recombinant vector of the 
invention or a gene comprising a g35O30 nucleic acid sequence disrupted by homologous 
20 recombination with a knock out vector. 

Recombinant cell lines may be established in vitro from cells obtained from any tissue 
of a transgenic animal according to the invention, for example by transfection of primary cell 
cultures with vectors expressing owc-genes such as SV40 large T antigen, as described by Chou 
(1989) and Shay etal.(1991). 

25 

Assays For Identification Of Compounds For Treatment Of Schizophrenia And 

Bipolar Disorder 

The present invention provides assays which may be used to test compounds for their 
ability to treat CNS disorders, and in particular, to ameliorate symptoms of a CNS disorder 

30 mediated by g35O30. In preferred embodiments, compounds tested for their ability to 

ameliorate syptoms of schizophrenia or bipolar disorder mediated by g35030. Compounds may 
also be tested for their ability to treat related disorders, including among others psychotic 
disorders, mood disorders, autism, substance dependence and alcoholism, mental retardation, and 
other psychiatric diseases including cognitive, anxiety, eating, impulse-control, and personality 

35 disorders, as defined with the Diagnosis and Statistical Manual of Mental Disorders fourth edition 
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(DSM-IV) classification. 

The present invention also provides cell and animal, including primate and mouse, models 
of schizophrenia, bipolar disorder and related disorders. 

In one aspect, provided are non-cell based, cell based and animal based assays for the 
5 identification of such compounds that affect g35030 activity. g35O30 activity may be affected 

by any mechanism; in certain embodiments, g35030 activity is affected by modulating g35030 
gene expression or the activity of the g3 5030 gene product. 

The present methods allow the identification of compounds that affect g35030 activity 
directly or indirectly. Thus, the non-cell based, cell based and animal assays of the present 

10 invention may also be used to identify compounds that act on an element of a g3503O pathway 

other tha g3 5030 itself These compounds can then be used as a therapeutic treatment to 
modulate g35030 and other gene products involved in schizophrenia, bipolar disorder and 
related disorders. 

Cell and non-cell based assays 

15 In one aspect, cell based assays using recombinant or non-recombinant cells may be 

used to identify compounds which modulate g35030 activity. 

In one aspect, a cell based assay of the invention encompasses a method for identifying 
a test compound for the treatment of schizophrenia or bipolar disorder comprising (a) exposing 
a cell to a test compound at a concentration and time sufficient to ameliorate an endpoint related 

20 to schizophrenia or bipolar disorder, and (b) determining the level of g35O30 activity in a cell. 

g35O30 activity can be measured, for example, by assaying a cell for mRNA transcript level, 
g35030 peptide expression, localization or protein activity. Preferably the test compound is a 
compound capable of or suspected to be capable of ameliorating a symptom of schizophrenia, 
bipolar disorder or a related disorder. Test compounds capable of modulating g35030 activity 

25 may be selected for use in developing medicaments. Such cell based assays are further 

described herein in the section titled "Method For Screening Ligands That Modulate The 
Expression Of The g35030 Gene." 

In another aspect, a cell based assay of the invention encompasses a method for 
identifying a compound for the treatment of schizophrenia or bipolar disorder comprising (a) 

30 exposing a cell to a level of g35O30 activity sufficient to cause a schizophrenia-related or 

bipolar disorder-related endpoint, and (b) exposing said cell to a test compound. A test 
compound can then be selected according to its ability to ameliorate said schizophrenia-related 
or bipolar disorder-related endpoints. g35030 activity may be provided by any suitable method, 
including but not limited to providing a vector containing a g35030 nucleotide sequence, 

35 treating said cell with a compound capable of increasing g3503O expression, and treating said 
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cell with a g35030 peptide. Preferably said cell is treated with a g35030 peptide comprising a 
contiguous span of at least 4 amino acids of SEQ ID Nos. 1 8 to 23. Preferably the test 
compound is a compound capable of or suspected to be capable of ameliorating a symptom of 
schizophrenia, bipolar disorder or a related disorder; alternatively, the test compound is 

5 suspected of exacerbating an endpoint schizophrenia, bipolar disorder or a related disorder. A 

test compound capable of ameliorating any detectable symptom or endpoint of a schizophrenia, 
bipolar disorder or a related disorder may be selected for use in developing medicaments. 

In another embodiment, the invention provides cell and non-cell based assays to g3503O 
to determine whether g3 5030 peptides bind to the cell surface, and to identify compounds for 
. 10 the treatment of schizophrenia, bipolar disorder and related disorders that interact with a g35030 

receptor. In one such embodiment, a g3 5030 polynucleotide, or fragments thereof, is cloned 
into expression vectors such as those described herein. The proteins are purified by size, 
charge, immunochromatography or other techniques familiar to those skilled in the art. 
Following purification, the proteins are labeled using techniques known to those skilled in the 

15 art. The labeled proteins are incubated with cells or cell lines derived from a variety Of organs 

or tissues to allow the proteins to bind to any receptor present on the cell surface. Following the 
incubation, the cells are washed to remove non-specifically bound protein. The labeled proteins 
are detected by autoradiography. Alternatively, unlabeled proteins may be incubated with the 
cells and detected with antibodies having a detectable label, such as a fluorescent molecule, 

20 attached thereto. Specificity of ceil surface binding may be analyzed by conducting a 

competition analysis in which various amounts of unlabeled protein are incubated along with 
the labeled protein. The amount of labeled protein bound to the cell surface decreases as the 
amount of competitive unlabeled protein increases. As a control, various amounts of an 
unlabeled protein unrelated to the labeled protein is included in some binding reactions. The 

25 amount of labeled protein bound to the cell surface does not decrease in binding reactions 

containing increasing amounts of unrelated unlabeled protein, indicating that the protein 
encoded by the nucleic acid binds specifically to the cell surface. 

In another embodiment, the present invention comprises non-cell based binding assays, 
wherein a g35030 polypeptide is prepared and purified as in cell based binding assays described 

30 above. Following purification, the proteins are labeled and incubated with a cell membrane 

extract or isolate derived from any desired cells from any organs, tissue or combination of 
organs or tissues of interest to allow the §35030 polypeptide to bind to any receptor present on a 
membrane. Following the incubation, the membranes are washed to remove non-specifically 
bound protein. The labeled proteins may be detected by autoradiography. Specificity of 

35 membrane binding of g3503O may be analyzed by conducting a competition analysis in which 
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various amounts of a test compound are incubated along with the labeled protein. Any desired 
test compound, including test polypeptides, can be incubated with the cells. The test 
compounds may be detected with antibodies having a detectable label, such as a fluorescent 
molecule, attached thereto. The amount of labeled g35030 polypeptide bound to the cell surface 
5 decreases as the amount of competitive test compound increases. As a control, various amounts 

of an unlabeled protein or a compound unrelated to the test compound is included in some 
binding reactions. Test compounds capable of reducing the amount of g35030 bound to cell 
membranes may be selected as a candidate therapeutic compound. 

Said ceil based assays may comprise cells of any suitable origin; particularly preferred 
10 cells are human cells, primate cells, non-human primate cells and mouse cells. If non^human 

primate cells are used, the g3 5 030 may comprise a nucleotide sequence or be encoded by a 
nucleotide sequence according to the primate nucleic acid sequences of SEQ ID No. 79 to 132, 
or a sequence complementary thereto or a fragment thereof. 
Animal model based assay 
1 5 Non-human animal based assays may also be used to identify compounds which 

modulate g35030 activity. The invention encompasses animal models and animal based assays, 
including non-transgenic or transgenic animals, including animals containing a human or altered 
form of the g35030 gene. 

Thus, the present invention comprises treating an animal affected by schizophrenia or 
20 bipolar disorder or symptoms thereof with a test compound capable of directly or indirectly 

modulating g35030 activity. 

In one aspect, an animal based assay of the invention encompasses a method for 
identifying a test compound for the treatment of schizophrenia or bipolar disorder comprising 
(a) exposing an animal to a test compound at a concentration and time sufficient to ameliorate 
25 an endpoint related to schizophrenia or bipolar disorder, and (b) determining the level of 

g35030 activity at a site in said animal. g35030 activity can be measured in any suitable cell, 
tissue or site. Preferably the test compound is a compound capable of or suspected to be 
capable of ameliorating a symptom of schizophrenia, bipolar disorder or a related disorder. 
Optionally said test compound is capable or suspected to be capable of modulating g35030 
30 activity. Test compounds capable of modulating g35030 activity may be selected for use in 

developing medicaments. 

In another aspect, an animal based assay of the invention encompasses a method for 
identifying a compound for the treatment of schizophrenia or bipolar disorder comprising (a) 
exposing an animal to a level of g35030 activity sufficient to cause a schizophrenia-related or 
35 bipolar disorder-related symptom or endpoint, and (b) exposing said animal to a test compound. 
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A test compound can then be selected according to its ability to ameliorate said schizophrenia- 
related or bipolar disorder-related endpoints. g35030 activity may be provided by any suitable 
method, including but not limited to providing a vector containing a §35030 nucleotide 
sequence, treating said animal with a compound capable of increasing g35030 expression and 
5 treating said cell with a g35030 peptide. Preferably, said animal is treated with a g35030 

peptide comprising a contiguous span of at least 4 amino acids of SEQ ID Nos. 1 8 to 23 . 
Preferably the test compound is a compound capable of or suspected to be capable of 
ameliorating a symptom of schizophrenia, bipolar disorder or a related disorder; alternatively, 
. the test compound is suspected of exacerbating a symptom of schizophrenia, bipolar disorder or 

ID a related disorder. A test compound capable of ameliorating any detectable symptom or 

endpoint of a schizophrenia, bipolar disorder or a related disorder may be selected for use in 
developing medicaments. 

Any suitable animal may be used. Preferably, said animal is a primate, a non-human 
primate, a mammal, or a mouse. 

15 In one embodiment, a mouse is treated with a g35030 peptide, exposed to a test 

compound, and symptoms indicative of schizophrenia, bipolar disorder or a related disorder are 
assessed by observing stereotypy. In other embodiments, said symptoms are assessed by 
performing at least one test from the group consisting of home cage observation, neurological 
evaluation, stress-induced hypothermia, forced swim, PTZ seizure, locomotor activity, tail 

20 suspension, elevated plus maze, novel object recognition, prepulse inhibition, thermal pain, Y- 

maze, and metabolic chamber tests (Psychoscreen™ tests available from Psychogenics Inc.). 
Other tests are known in Crawley et al, Horm. Behav. 31(3):197-21 1 (1997); Crawley, Brain 
Res 835(i):18-26 (1999) for example. 

Any suitable test compound may be used with the screening methods of the invention. 

25 Examples of compounds that may be screened by the methods of the present invention include 

small organic or inorganic molecules, nucleic acids, including polynucleotides from random and 
directed polynucleotide libraries, peptides, including peptides derived from random and directed 
peptide libraries, soluble peptides, fusion peptides, and phosphopeptides, antibodies including 
polyclonal, monoclonal, chimeric, humanized, and anti-idiotypic antibodies, and single chain 

30 antibodies, FAb, F(ab')2 and FAb expression library fragments, and epitope-binding fragments 

thereof. In certain aspects, a compound capable of ameliorating or exacerbating a symptom or 
endpoint of schizophrenia, bipolar disorder or a related disorder may include, by way of 
example, antipsychotic drugs in general, neuroleptics, atypical neuroleptics, antidepressants, 
anti-anxiety drugs, noradrenergic agonists and antagonists, dopaminergic agonists and 

35 antagonists, serotonin reuptake inhibitors, benzodiazepines. 



WO 01/40493 PCT/IB00/O15O7 

125 

Methods for screening substances interacting with a g35030 polypeptides 

For the purpose of the present invention, a ligand means a molecule, such as a protein, a 
peptide, an antibody or any synthetic chemical compound capable of binding to the g35030 
protein or one of its fragments or variants or to modulate the expression of the polynucleotide 

5 coding for the g35030 or a fragment or variant thereof. 

In the ligand screening method according to the present invention, a biological sample 
or a defined molecule to be tested as a putative ligand of the g35030 protein is brought into 
contact with the corresponding purified g35030 protein, for example the corresponding purified 
recombinant g35030 protein produced by a recombinant cell host as described hereinbefore, in 

1 0 order to form a complex between this protein and the putative 1 igand molecule to be tested. 

As an illustrative example, to study the interaction of the g35030 protein, or a fragment 
comprising a contiguous span of at least 4 amino acids, preferably at least 6, or preferably at 
least 8 to 1 0 amino acids, more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino acids 
of SEQ ID Nos IS to 23, with drugs or small molecules, such as molecules generated through 

15 combinatorial chemistry approaches, the microdialysis coupled to HPLC method described by 

Wang et al. (1 997) or the affinity capillary electrophoresis method described by Bush et al. 
(1997), can be used. 

In further methods, peptides, drugs, fatty acids, lipoproteins, or small molecules which 
interact with the g35030 protein, or a fragment comprising a contiguous span of at least 4 amino 
20 acids, preferably at least 6, or preferably at least 8 to 1 0 amino acids, more preferably at least 

12, 15, 20, 25, 30, 40, 50, or 1 00 amino acids of SEQ ID Nos 1 8 to 23, may be identified using 
assays such as the following. The molecule to be tested for binding is labeled with a detectable 
label, such as a fluorescent , radioactive, or enzymatic tag and placed in contact with 
immobilized g35030 protein, or a fragment thereof under conditions which permit specific 
25 binding to occur. After removal of non-specifically bound molecules, bound molecules are 

detected using appropriate means. 

Another object of the present invention comprises methods and kits for the screening of 
candidate substances that interact with a g3503O polypeptide. 

The present invention pertains to methods for screening substances of interest that 
30 interact with a g35030 protein or one fragment or variant thereof. By their capacity to bind 

covalently or non-covalently to a g35030 protein or to a fragment or variant thereof, these 
substances or molecules may be advantageously used both in vitro and in vivo. 

In vitro,, said interacting molecules may be used as detection means in order to identify 
the presence of a g35030 protein in a sample, preferably a biological sample. 
35 A method for the screening of a candidate substance comprises the following steps : 
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a) providing a polypeptide comprising, consisting essentially of, or consisting of a 
g35030 protein or a fragment comprising a contiguous span of at least 4 amino acids, preferably 
at least 6 amino acids, more preferably at least 8 to 10 amino acids, more preferably at least 12, 
1 5, 20, 25, 30, 40, 50, or 1 00 amino acids of SEQ ID Nos. 1 8 to 23; 
5 b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; and 

d) detecting the complexes formed between said polypeptide and said candidate 
substance. 

The invention further concerns a kit for the screening of a candidate substance 
10 interacting with the g3503O polypeptide, wherein said kit comprises: 

a) a g35030 protein having an amino acid sequence selected from the group consisting 
of the amino acid sequences of SEQ ID Nos. 1 8 to 23 or a peptide fragment comprising a 
contiguous span of at least 4 amino acids, preferably at least 6 amino acids, more preferably at 
least 8 to 10 amino acids, and more preferably at least 12, 15, 20, 25, 30, 40, 50, or 100 amino 

15 acids of SEQ ID Nos. 18 to 23; and 

b) optionally means useful to detect the complex formed between the g35O30 protein or 
a peptide fragment or a variant thereof and the candidate substance. 

In a preferred embodiment of the kit described above, the detection means comprise 
monoclonal or polyclonal antibodies directed against the g35030 protein or a peptide fragment 

20 or a variant thereof. 

Various candidate substances or molecules can be assayed for interaction with a g35030 
polypeptide. These substances or molecules include, without being limited to, natural or 
synthetic organic compounds or molecules of biological origin such as polypeptides. When the 
candidate substance or molecule comprise a polypeptide, this polypeptide may be the resulting 

25 expression product of a phage clone belonging to a phage-based random peptide library, or 

alternatively the polypeptide may be the resulting expression product of a cDNA library cloned 
in a vector suitable for performing a two-hybrid screening assay. 

The invention also pertains to kits useful for performing the hereinbefore described 
screening method. Preferably, such kits comprise a g35030 polypeptide or a fragment or a 

3 0 variant thereof, and optionally means useful to detect the complex formed between the g3 5 03 0 

polypeptide or its fragment or variant and the candidate substance. In a preferred embodiment 
the detection means comprise monoclonal or polyclonal antibodies directed against the 
corresponding g35030 polypeptide or a fragment or a variant thereof. 



WO 01/40493 PCT/DB00/01507 

127 

A. Candidate ligands obtained from random peptide libraries 

In a particular embodiment of the screening method, the putative ligand is the 
expression product of a DNA insert contained in a phage vector (Parmley and Smith, 1 988). 
Specifically, random peptide phages libraries are used. The random DNA inserts encode for 
5 peptides of 8 to 20 amino acids in length (Oldenburg K.R. et aL, 1 992; Valadon P., et al., 1 996; 

Lucas A.H., 1994; Westerink M.A J., 1995; Felici F. et al., 1991). According to this particular 
embodiment, the recombinant phages expressing a protein that binds to the immobilized g35030 
protein is retained and the complex formed between the g35030 protein and the recombinant 
phage may be subsequently immunoprecipitated by a polyclonal or a monoclonal antibody 

10 directed against the g3 5030 protein. 

Once the ligand library in recombinant phages has been constructed, the phage 
population is brought into contact with the immobilized g35030 protein. Then the preparation 
of complexes is washed in order to remove the non-spec ifically bound recombinant phages. 
The phages that bind specifically to the g35030 protein are then eluted by a buffer (acid pH) or 

1 5 immunoprecipitated by the monoclonal antibody produced by the hybridoma anti- g35030, and 

this phage population is subsequently amplified by an over-infection of bacteria (for example E. 
coli). The selection step may be repeated several times, preferably 2-4 times, in order to select 
the more specific recombinant phage clones. The last step comprises characterizing the peptide 
produced by the selected recombinant phage clones either by expression in infected bacteria and 

20 isolation, expressing the phage insert in another host-vector system, or sequencing the insert 

contained in the selected recombinant phages. 

B. Candidate ligands obtained by competition experiments. 

Alternatively, peptides, drugs or small molecules which bind to the g3503O protein, or a 
fragment comprising a contiguous span of at least 4 amino acids, preferably at least 6 amino 

25 acids, more preferably at least 8 to 10 amino acids, and more preferably at least 12, 1 5, 20, 25, 

30, 40, 50, or 100 amino acids of SEQ ID Nos. 18 to 23, may be identified in competition 
experiments. In such assays, the g35030 protein, or a fragment thereof, is immobilized to a 
surface, such as a plastic plate. Increasing amounts of the peptides, drugs or small molecules 
are placed in contact with the immobilized g35030 protein, or a fragment thereof, in the 

30 presence of a detectable labeled known g35030 protein ligand. For example, the g35030 ligand 

may be detectably labeled with a fluorescent, radioactive, or en2ymatic tag. The ability of the 
test molecule to bind the g35030 protein, or a fragment thereof, is determined by measuring the 
amount of detectably labeled known ligand bound in the presence of the test molecule. A 
decrease in the amount of known ligand bound to the g35030 protein, or a fragment thereof, 
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when the test molecule is present indicated that the test molecule is able to bind to the g35030 
protein, or a fragment thereof, 

C. Candidate ligands obtained by affinity chromatography- 

Proteins or other molecules interacting with the g35030 protein, or a fragment 
5 comprising a contiguous span of at 4 amino acids, preferably at least 6 amino acids, more 

preferably at least 8 to 1 0 amino acids, and more preferably at least 12, 15, 20, 25, 30, 40, 50, or 
100 amino acids of SEQ ID Nos 1 8 to 23,.can also be found using affinity columns which 
contain the g35030 protein, or a fragment thereof. The g35030 protein, or a fragment thereof, 
may be attached to the column using conventional techniques including chemical coupling to a 
10 suitable column matrix such as agarose, Affi Gel® , or other matrices familiar to those of skill 

in art. In some embodiments of this method, the affinity column contains chimeric proteins in 
which the g35030 protein, or a fragment thereof, is fused to glutathion S transferase (GST). A 
mixture of cellular proteins or pool of expressed proteins as described above is applied to the 
affinity column. Proteins or other molecules interacting with the g35030 protein, or a fragment 
1 5 thereof, attached to the column can then be isolated and analyzed on 2-D electrophoresis gel as 

described in Ramunsen et al. (1997). Alternatively, the proteins retained on the affinity column 
can be purified by electrophoresis based methods and sequenced. The same method can be used 
to isolate antibodies, to screen phage display products, or to screen phage display human 
antibodies. 

20 D. Candidate ligands obtained by optical biosensor methods 

Proteins interacting with the g35030 protein, or a fragment comprising a contiguous 
span of at least 4 amino acids, preferably at least 6 amino acids, more preferably at least 8 to 10 
amino acids, and more preferably at least 12, 1 5, 20, 25, 30, 40, 50, or 100 amino acids of SEQ 
ID Nos. 18 to 23, can also be screened by using an Optical Biosensor as described in Edwards 

25 and Leatherbarrow (1997) and also in Szabo et ai. (1995). This technique permits the detection 

of interactions between molecules in real time, without the need of labeled molecules. This 
technique is based on the surface plasmon resonance (SPR) phenomenon. Briefly, the candidate 
ligand molecule to be tested is attached to a surface (such as a carboxymethyl dextran matrix). 
A light beam is directed towards the side of the surface that does not contain the sample to be 

30 tested and is reflected by said surface. The SPR phenomenon causes a decrease in the intensity 

of the reflected light with a specific association of angle and wavelength. The binding of 
candidate ligand molecules cause a change in the refraction index on the surface, which change 
is detected as a change in the SPR signal. For screening of candidate ligand molecules or 
substances that are able to interact with the g35030 protein, or a fragment thereof, the g35030 
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protein, or a fragment thereof, is immobilized onto a surface. This surface comprises one side 
of a cell through which flows the candidate molecule to be assayed. The binding of the 
candidate molecule on the g35030 protein, or a fragment thereof, is detected as a change of the 
SPR signal. The candidate molecules tested may be proteins, peptides, carbohydrates, lipids, or 
5 small molecules generated by combinatorial chemistry. This technique may also be performed 

by immobilizing eukaryotic or prokaryotic cells or lipid vesicles exhibiting an endogenous or a 
recombinantly expressed g35030 protein at their surface. 

The main advantage of the method is that it allows the determination of the association 
rate between the g35030 protein and molecules interacting with the g35030 protein. It is thus 
1 0 possible to select specifically ligand molecules interacting with the g35030 protein, or a 

fragment thereof, through strong or conversely weak association constants. 

£. Candidate Iigands obtained through a two-hybrid screening assay. 

The yeast two-hybrid system is designed to study protein-protein interactions in vivo 
(Fields and Song, 1 989), and relies upon the fusion of a bait protein to the DNA binding domain 
15 of the yeast Gal4 protein. This technique is also described in the US Patent N° US 5,667,973 

and the US Patent N° 5,283,173 (Fields et aL). 

The general procedure of library screening by the two-hybrid assay may be performed 
as described by Harper et al. (1993) or as described by Cho et aL (1998) or also Fromont-Racine 
etal.(1997). 

20 The bait protein or polypeptide comprises, consists essentially of, or consists of a 

g35030 polypeptide or a fragment comprising a contiguous span of at least 4 amino acids, 
preferably at least 6 amino acids,~rnore preferably at least 8 to 1 0 amino acids, and more 
preferably at least 12, 15, 20, 25, 30, 40, 50, or 1 00 amino acids of SEQ ID Nos, 18 to 23. 

More precisely, the nucleotide sequence encoding the g35030 polypeptide or a fragment 

25 or variant thereof is fused to a polynucleotide encoding the DNA binding domain of the GAL4 

protein, the fused nucleotide sequence being inserted in a suitable expression vector, for 
example pAS2 or pM3 . 

Then, a human cDNA library is constructed in a specially designed vector, such that the 
human cDNA insert is fused to a nucleotide sequence in the vector that encodes the 

3D transcriptional domain of the GAL4 protein. Preferably, the vector used is the pACT vector. 

The polypeptides encoded by the nucleotide inserts of the human cDNA library are termed 
"pray' ■ polypeptides. 

A third vector contains a detectable marker gene, such as beta galactosidase gene or 
CAT gene that is placed under the control of a regulation sequence that is responsive to the 



WO 01/40493 PCT/IB00/01507 

130 

binding of a complete Gal4 protein containing both the transcriptional activation domain and 
the DNA binding domain. For example, the vector pG5EC may be used. 

Two different yeast strains are also used. As an illustrative but non limiting example 
the two different yeast strains may be the followings : 
5 - Y 1 90, the phenotype of which is (MATa, Leu2-3, 112 ura3-12, trpl-901, his3-D200 f ade2- 

101, gal4Dgall80D URA3 GAL-LacZ, LYS GAL-HIS3, cyhj, 
- Y 1 87, the phenotype of which is {MATagaU gal80 his3 trpl-901 ade2-101 ura3-52 leu2-3, 
-112 URA3 GAL-lacZmef), which is the opposite mating type of Y190. 

Briefly, 20 fig of pAS2/ g35030 and 20 ng of pACT-cDNA library are co-transformed 

10 into yeast strain Y190. The transformants are selected for growth on minimal media lacking 

histidine, leucine and tryptophan, but containing the histidine synthesis inhibitor 3-AT (50 
mM). Positive colonies are screened for beta galactosidase by filter lift assay. The double 
positive colonies (His + , beta-gat) are then grown on plates lacking histidine, leucine, but 
containing tryptophan and cycloheximide ( 1 0 rng/ml) to select for loss of pAS2/ g35030 

15 plasmids by retention of pACT-cDNA library plasmids. The resulting Y190 strains are mated 

with Y187 strains expressing g35030 or non-related control proteins; such as cyclophilin B, 
lamin, or SNF1, as Gal4 fusions as described by Harper et aL (1 993) and by Bram et al. (Bram 
RJ et al., 1993), and screened for beta galactosidase by filter lift assay. Yeast clones that are 
beta gal- after mating with the control Gal4 fusions are considered false positives. 

20 In another embodiment of the two-hybrid method according to the invention, interaction 

between the g35030 or a fragment or variant thereof with cellular proteins may be assessed 
using the Matchmaker Two Hybrid System 2 (Catalog No. Kl 604-1 , Clontech). As described in 
the manual accompanying the Matchmaker Two Hybrid System 2 (Catalog No. K1604-1, 
Clontech), nucleic acids encoding the g35030 protein or a portion thereof, are inserted into an 

25 expression vector such that they are in frame with DNA encoding the DNA binding domain of the 

yeast transcriptional activator GAL4. A desired cDNA, preferably human cDNA, is inserted into a 
second expression vector such that they are in frame with DNA encoding the activation domain of 
GALA The two expression plasmids are transformed into yeast and the yeast are plated on 
selection medium which selects for expression of selectable markers on each of the expression 

30 vectors as well as GAL4 dependent expression of the HIS3 gene. Transformants capable of 

growing on medium lacking histidine are screened for GAL4 dependent lacZ expression. Those 
cells which are positive in both the histidine selection and the IacZ assay contain interaction 
between g35030 and the protein or peptide encoded by the initially selected cDNA insert. 
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Method For Screening Substances Interacting With The Regulatory Sequences Of 
A g35030 Gene. 

The present invention also concerns a method for screening substances or molecules 
that are able to interact with the regulatory sequences of the g35030 gene, such as for example 

5 promoter or enhancer sequences. 

Nucleic acids encoding proteins which are able to interact with the regulatory sequences 
of the g35030 gene, more particularly a nucleotide sequence selected from the group consisting 
of the polynucleotides of the 5' and 3' regulatory region or a fragment or variant thereof, and 
preferably a variant comprising one of the biallelic markers of the invention, may be identified 
,10 by using a one-hybrid system, such as that described in the booklet enclosed in the Matchmaker 

One-Hybrid System kit from Clontech (Catalog Ref. n° K 1603-1). Briefly, the target nucleotide 
sequence is cloned upstream of a selectable reporter sequence and the resulting DNA construct 
is integrated in the yeast genome {Saccharomyces cerevisiae). The yeast cells containing the 
reporter sequence in their genome are then transformed with a library comprising fusion 

1 5 molecules between cDNAs encoding candidate proteins for binding onto the regulatory 

sequences of the g35030 gene and sequences encoding the activator domain of a yeast 
transcription factor such as GAL4. The recombinant yeast cells are plated in a culture broth for 
selecting cells expressing the reporter sequence. The recombinant yeast cells thus selected 
contain a fusion protein that is able to bind onto the target regulatory sequence of the g35030 

20 gene. Then, the cDNAs encoding the fusion proteins are sequenced and may be cloned into 

expression or transcription vectors m vitro. The binding of the encoded polypeptides to the 
target regulatory sequences of the g3503O gene may be confirmed by techniques familiar to the 
one skilled in the art, such as gel retardation assays or DNAse protection assays. 

Gel retardation assays may also be performed independently in order to screen 

25 candidate molecules that are able to interact with the regulatory sequences of the g3503O gene, 

such as described by Fried and Crothers (1981), Garner and Revzin (1981) and Dent and 
Latchman (1 993). These techniques are based on the principle according to which a DNA 
fragment which is bound to a protein migrates slower than the same unbound DNA fragment. 
Briefly, the target nucleotide sequence is labeled. Then the labeled target nucleotide sequence 

30 is brought into contact with either a total nuclear extract from celts containing transcription 

factors, or with different candidate molecules to be tested. The interaction between the target 
regulatory sequence of the g35030 gene and the candidate molecule or the transcription factor is 
detected after gel or capillary electrophoresis through a retardation in the migration. 



Method For Screening Ligands That Modulate The Expression Of The g35030 
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Gene 

Another subject of the present invention is a method for screening molecules that 
modulate the expression of the g35G30 protein. Such a screening method comprises the steps 
of: 

5 a) cultivating a prokaryotic or an eukaryotic cell that has been transfected with a 

nucleotide sequence encoding the g35030 protein or a variant or a fragment thereof, placed 
under the control of its own promoter; 

b) bringing into contact the cultivated cell with a molecule to be tested; 

c) quantifying the expression of the g35030 protein or a variant or a fragment thereof. 
10 In an embodiment, the nucleotide sequence encoding the g35030 protein or a variant or 

a fragment thereof comprises an allele of at least one g35Q30 related bialleiic marker. 

Using DNA recombination techniques well known by the one skill in the art, the 
g35030 protein encoding DNA sequence is inserted into an expression vector, downstream from 
its promoter sequence. As an illustrative example, the promoter sequence of the g35030 gene is 

1 5 contained in the nucleic acid of the 5' regulatory region. 

The quantification of the expression of the g35G30 protein may be realized either at the 
mRNA level or at the protein level. In the latter case, polyclonal or monoclonal antibodies may 
be used to quantify the amounts of the g35030 protein that have been produced, for example in 
an ELISA or a RIA assay. 

20 In a preferred embodiment, the quantification of the g35030 mRNA is realized by a 

quantitative PCR amplification of the cDNA obtained by a reverse transcription of the total 
mRNA of the cultivated g35030 -transfected host cell, using a pair of primers specific for 
g35030. 

The present invention also concerns a method for screening substances or molecules 
25 that are able to increase, or in contrast to decrease, the level of expression of the g35030 gene. 

Such a method may allow the one skilled in the art to select substances exerting a regulating 
effect on the expression level of the g35030 gene and which may be useful as active ingredients 
included in pharmaceutical compositions for treating patients suffering from diseases. 

Thus, is also part of the present invention a method for screening of a candidate 
30 substance or molecule that modulated the expression of the g35030 gene, this method comprises 

the following steps: 

- providing a recombinant cell host containing a nucleic acid, wherein said nucleic acid 
comprises a nucleotide sequence of the 5' regulatory region or a biologically active fragment or 
variant thereof located upstream a polynucleotide encoding a detectable protein; 
35 - obtaining a candidate substance; and 
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- determining the ability of the candidate substance to modulate the expression levels of 
the polynucleotide encoding the detectable protein. 

In a further embodiment, the nucleic acid comprising the nucleotide sequence of the 5' 
regulatory region or a biologically active fragment or variant thereof also includes a 5'UTR 
5 region of the g35030 cDNA of SEQ ID No 2 to 1 7, or one of its biologically active fragments or 

variants thereof. 

Among the preferred polynucleotides encoding a detectable protein, there may be cited 
polynucleotides encoding beta galactosidase, green fluorescent protein (GFP) and 
chloramphenicol acetyl transferase (CAT). 

10 The invention also pertains to kits useful for performing the herein described screening 

method. Preferably, such kits comprise a recombinant vector that allows the expression of a 
nucleotide sequence of the 5' regulatory region or a biologically active fragment or variant 
thereof located upstream and operably linked to a polynucleotide encoding a detectable protein 
or the g35030 protein or a fragment or a variant thereof. 

1 5 In another embodiment of a method for the screening of a candidate substance or 

molecule that modulates the expression of the g3503O gene, wherein said method comprises the 
following steps: 

a) providing a recombinant host cell containing a nucleic acid, wherein said nucleic acid 
comprises a 5'UTR sequence of a g35030 cDNA, preferably of a g35030, or one of its 

20 biologically active fragments or variants, the 5'UTR sequence or its biologically active 

fragment or variant being operably linked to a polynucleotide encoding a detectable protein; 

b) obtaining a candidate substance; and 

c) determining the ability of the candidate substance to modulate the expression levels 
of the polynucleotide encoding the detectable protein. 

25 In a specific embodiment of the above screening method, the nucleic acid that 

comprises a nucleotide sequence selected from the group consisting of the 5'UTR sequence of a 
g35030 cDNA, preferably of a g35O30 or one of its biologically active fragments or variants, 
includes a promoter sequence which is endogenous with respect to the g35030 5'UTR sequence. 
In another specific embodiment of the above screening method, the nucleic acid that 

30 comprises a nucleotide sequence selected from the group consisting of the 5'UTR sequence of a 

g35030 cDNA or one of its biologically active fragments or variants, includes a promoter 
sequence which is exogenous with respect to the g35O30 5'UTR sequence defined therein. 

In a further preferred embodiment, the nucleic acid comprising the 5'-UTR sequence of 
a g35O30 cDNA or the biologically active fragments thereof includes a g3503O-related biallelic 

35 marker. 
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The invention further comprises a kit for the screening of a candidate substance 
modulating the expression of the g35030 gene, wherein said kit comprises a recombinant vector 
that comprises a nucleic acid including a 5'UTR sequence of the g35030 cDNA of SEQ ID Nos 
2 to 1 7, or one of their biologically active fragments or variants, the 5'UTR sequence or its 
5 biologically active fragment or variant being operably linked to a polynucleotide encoding a 

detectable protein. 

For the design of suitable recombinant vectors useful for performing the screening 
methods described above, it will be referred to the section of the present specification wherein 
the preferred recombinant vectors of the invention are detailed. 

10 Expression levels and patterns of g35O30 may be analyzed by solution hybridization 

with long probes as described in International Patent Application No. WO 97/05277. Briefly, 
the g35030 cDNA or the g35030 genomic DNA described above, or fragments thereof, is 
inserted at a cloning site immediately downstream of a bacteriophage (T3, T7 or SP6) RNA 
polymerase promoter to produce antisense RNA. Preferably, the g3503O insert comprises at 

1 5 least 100 or more consecutive nucleotides of the genomic DNA sequence or the cDNA 

sequences. The plasmid is linearized and transcribed in the presence of ribonucleotides 
comprising modified ribonucleotides (i.e. biotin-UTP and DIG-UTP). An excess of this doubly 
labeled RNA is hybridized in solution with mRNA isolated from cells or tissues of interest. The 
hybridization is performed under standard stringent conditions (40-5 0°C for 16 hours in an 80% 

20 formamide, 0. 4 M NaCI buffer, pH 7-8). The unhybridized probe is removed by digestion 

with ribonucleases specific for single-stranded RNA (i.e. RNases CL3, Tl , Phy M, U2 or A). 
The presence of the biotin-UTP modification enables capture of the hybrid on a microtitration 
plate coated with streptavidin. The presence of the DIG modification enables the hybrid to be 
detected and quantified by ELISA using an anti-DIG antibody coupled to alkaline phosphatase. 

25 Quantitative analysis of g35030 gene expression may also be performed using arrays. 

As used herein, the term array means a one dimensional, two dimensional, or multidimensional 
arrangement of a plurality of nucleic acids of sufficient length to permit specific detection of 
expression of mRNAs capable of hybridizing thereto. For example, the arrays may contain a 
plurality of nucleic acids derived from genes whose expression levels are to be assessed. The 

30 arrays may include the g35030 genomic DNA, the g35030 cDNA sequences or the sequences 

complementary thereto or fragments thereof, particularly those comprising at least one of the 
bialleiic markers according the present invention. Preferably, the fragments are at least 15 
nucleotides in length. In other embodiments, the fragments are at least 25 nucleotides in length. 
In some embodiments, the fragments are at least 50 nucleotides in length. More preferably, the 

35 fragments are at least 100 nucleotides in length. In another preferred embodiment, the 
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fragments are more than 1 00 nucleotides in length. In some embodiments the fragments may be 

more than 500 nucleotides in length. 

For example, quantitative analysis of g35030 gene expression may be performed with a 

complementary DNA microarray as described by Schena et al.(1995 and 1996). Full length 
5 g35030 cDNAs or fragments thereof are amplified by PCR and arrayed from a 96-welI 

microtiter plate onto silylated microscope slides using high-speed robotics. Printed arrays are 

incubated in a humid chamber to allow rehydration of the array elements and rinsed, once in 0. 

2% SDS for 1 min, twice in water for 1 min and once for 5 min in sodium borohydride solution. 

The arrays are submerged in water for 2 min at 95°C, transferred into 0. 2% SDS for 1 min, 
1 0 rinsed twice with water, air dried and stored in the dark at 25°C. 

Cell or tissue mRNA is isolated or commercially obtained and probes are prepared by a 

single round of reverse transcription. Probes are hybridized to 1 cm 2 microarrays under a 14 x 

14 mm glass coverslip for 6-12 hours at 60°C. Arrays are washed for 5 min at 25°C in low 

stringency wash buffer (1 x SSC/0. 2% SDS), then for 10 min at room temperature in high 
15 stringency wash buffer (0. 1 x SSC/0. 2% SDS). Arrays are scanned in 0. 1 x SSC using a 

fluorescence laser scanning device fitted with a custom filter set. Accurate differential 

expression measurements are obtained by taking the average of the ratios of two independent 

hybridizations. 

Quantitative analysis of g35030 gene expression may also be performed with full length 
20 g35030 cDNAs or fragments thereof in complementary DNA arrays as described by Pietu et 

al,(l 996). The full length g35030 cDNA or fragments thereof is PCR amplified and spotted on 
membranes. Then, mRNAs originating from various tissues or cells are labeled with 
radioactive nucleotides. After hybridization and washing in controlled conditions, the 
hybridized mRNAs are detected by phospho-imaging or autoradiography. Duplicate 
25 experiments are performed and a quantitative analysis of differentially expressed mRNAs is 

then performed. 

Alternatively, expression analysis using the g35030 genomic DNA, the g35030 cDNA, 
or fragments thereof can be done through high density nucleotide arrays as described by 
Lockhart et al.(1996) and Sosnowsky et al.(1997). Oligonucleotides of 15-50 nucleotides from 

30 the sequences of the g35030 genomic DNA, the g35030 cDNA sequences particularly those 

comprising at least one of biallelic markers according the present invention, or the sequences 
complementary thereto, are synthesized directly on the chip (Lockhart et al., supra) or 
synthesized and then addressed to the chip (Sosnowski et al., supra). Preferably, the 
oligonucleotides are about 20 nucleotides in length. 

35 g35030 cDNA probes labeled with an appropriate compound, such as biotin, 
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digoxigenin or fluorescent dye, are synthesized from the appropriate mRNA population and 
then randomly fragmented to an average size of 50 to 1 00 nucleotides. The said probes are then 
hybridized to the chip. After washing as described in Lockhart et al. s supra and application of 
different electric fields (Sosnowsky et aL, 1997)., the dyes or labeling compounds are detected 
5 and quantified. Duplicate hybridizations are performed. Comparative analysis of the intensity 

of the signal originating from cDNA probes on the same target oligonucleotide in different 
cDNA samples indicates a differential expression of g35030 mRNA. 

Methods For Inhibiting The Expression Of A g35030 Gene 
Other therapeutic compositions according to the present invention comprise 
10 advantageously an oligonucleotide fragment of the nucleic sequence of g35030 as an antisense 

tool or a triple helix tool that inhibits the expression of the corresponding g35030 gene. A 
preferred fragment of the nucleic sequence of g35030 comprises an allele of at least one of the 
biallelic markers of the invention. 

Antisense Approach 

1 5 Preferred methods using antisense polynucleotide according to the present invention are 

the procedures described by Sczakiel et al.(1995). 

Preferably, the antisense tools are chosen among the polynucleotides (15-200 bp long) 
that are complementary to the 5'end of the g35030 mRNA. In another embodiment, a 
combination of different antisense polynucleotides complementary to different parts of the 

20 desired targeted gene are used. 

Preferred antisense polynucleotides according to the present invention are 
complementary to a sequence of the mRNAs of g35030 that contains either the translation 
initiation codon ATG or a splicing donor or acceptor site. 

The antisense nucleic acids should have a length and melting temperature sufficient to 

25 permit formation of an intracellular duplex having sufficient stability to inhibit the expression of 

the g35030 mRNA in the duplex. Strategies for designing antisense nucleic acids suitable for 
use in gene therapy are disclosed in Green et aL, (1986) and Izant and Weintraub, (1984). 

In some strategies, antisense molecules are obtained by reversing the orientation of the 
g35030 coding region with respect to a promoter so as to transcribe the opposite strand from 

30 that which is normally transcribed in the cell. The antisense molecules may be transcribed 

using in vitro transcription systems such as those which employ T7 or SP6 polymerase to 
generate the transcript. Another approach involves transcription of g35030 antisense nucleic 
acids in vivo by operably linking DN A containing the antisense sequence to a promoter in a 
suitable expression vector. 
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Alternatively, suitable antisense strategies are those described by Rossi et al.(1991), in 
the International Applications Nos. WO 94/23026, WO 95/04141, WO 92/18522 and in the 
European Patent Application No. EP 0 572 287 A2 

An alternative to the antisense technology that is used according to the present 
5 invention comprises using ribozymes that will bind to a target sequence via their 

complementary polynucleotide tail and that will cleave the corresponding RNA by hydrolyzing 
its target site (namely "hammerhead ribozymes")- Briefly, the simplified cycle of a 
hammerhead ribozyme comprises (1) sequence specific binding to the target RNA via 
complementary antisense sequences; (2) site-specific hydrolysis of the cleavable motif of the 

10 target strand; and (3) release of cleavage products, which gives rise to another catalytic cycle. 

Indeed, the use of long-chain antisense polynucleotide (at least 30 bases long) or ribozymes 
with long antisense arms are advantageous. A preferred delivery system for antisense ribozyme 
is achieved by covalently linking these antisense ribozymes to lipophilic groups or to use 
liposomes as a convenient vector. Preferred antisense ribozymes according to the present 

1 5 invention are prepared as described by Sczakiel et al.( 1 995)* 

Triple Helix Approach 

The g35030 genomic DNA may also be used to inhibit the expression of the g35030 
gene based on intracellular triple helix formation. 

Triple helix oligonucleotides are used to inhibit transcription from a genome. They are 
20 particularly useful for studying alterations in cell activity when it is associated with a particular 

gene. 

Similarly, a portion of the g35O30 genomic DNA can be used to study the effect of 
inhibiting g3503O transcription within a cell Traditionally, homopurine sequences were 
considered the most useful for triple helix strategies. However, homopyrimidine sequences can 
25 also inhibit gene expression. Such homopyrimidine oligonucleotides bind to the major groove 

at homopurinerhomopyrimidine sequences. Thus, both types of sequences from the g35030 
genomic DNA are contemplated within the scope of this invention. 

To carry out gene therapy strategies using the triple helix approach, the sequences of 
the g3503O genomic DNA are first scanned to identify 1 0-mer to 20-mer homopyrimidine or 
30 homopurine stretches which could be used in triple-helix based strategies for inhibiting g35030 

expression. Following identification of candidate homopyrimidine or homopurine stretches, 
their efficiency in inhibiting g35030 expression is assessed by introducing varying amounts of 
oligonucleotides containing the candidate sequences into tissue culture cells which express the 
g35030gene. 
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The oligonucleotides can be introduced into the cells using a variety of methods known 
to those skilled in the art, including but not limited to calcium phosphate precipitation, DEAE- 
Dextran, electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for altered cell function or reduced g3503O expression using 
5 techniques such as Northern blotting, RNase protection assays, or PCR based strategies to 

monitor the transcription levels of the g35030 gene in cells which have been treated with the 
oligonucleotide. 

The oligonucleotides which are effective in inhibiting gene expression in tissue culture 
cells may then be introduced in vivo using the techniques described above in the antisense 
10 approach at a dosage calculated based on the in vitro results, as described in antisense approach. 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be 
replaced with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, 
an intercalating agent such as ethidium bromide, or the like, can be attached to the 3' end of the 
alpha oligonucleotide to stabilize the triple helix. For information on the generation of 
15 oligonucleotides suitable for triple helix formation see Griffin et al.(1989). 

Pharmaceutical Compositions And Formulations 

G35030-modulating Compounds 

Using the methods disclosed herein, compounds that selectively modulate g35030 activity 
20 in vitro and in vivo may be identified . The compounds identified by the process of the invention 

include, for example, antibodies having binding specificity for the §35030 peptide. It is also 
expected that homologues of g35030 may be useful for modulating g3503O-mediated activity and 
the related physiological condition associated with schizophrenia or bipolar disorder. Generally, it 
is further expected that assay methods of the present invention based on the role of g35030 in 
25 central nervous system disorder may be used to identify compounds capable of intervening in the 

assay cascade of the invention. 
Indications 

While g35030 has demonstrated an association with schizophrenia and bipolar disorder, 
indications involving g35030 may include various central nervous system disorders. Nervous 

30 system disorders are expected to have complex genetic bases and often share certain symptoms. In 

particular, as described herein, indications may include schizophrenia and other psychotic 
disorders, mood disorders, autism, substance dependence and alcoholism, mental retardation, and 
other psychiatric diseases including cognitive, anxiety, eating, impulse-control, and personality 
disorders, as defined with the Diagnosis and Statistical Manual of Mental Disorders fourth edition 

35 (DSM-IV) classification. 
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Pharmaceutical Formulations and Routes of Administration 

The compounds identified using the methods of the present invention can be administered 
to a mammal, including a human patient, alone or in pharmaceutical compositions where they are 
mixed with suitable carriers or excipient(s) at therapeutically effective doses to treat or ameliorate 
5 schizophrenia or bipolar disorder related disorders. A therapeutically effective dose further refers 

to that amount of the compound sufficient to result in amelioration of symptoms as determined by 
the methods described herein. Preferably, a therapeutically effective dosage is suitable for 
continued periodic use or administration. Techniques for formulation and administration of the 
compounds of the instant application may be found in "Remington's Pharmaceutical Sciences," 
. 10 Mack Publishing Co., Easton, PA, latest edition. 

Routes of Administration 

Suitable routes of administration include oral, rectal, transmucosal, or intestinal 
administration, parenteral delivery, including intramuscular, subcutaneous, intramedullary 
injections, as well as intrathecal, direct intraventricular, intravenous, intraperitoneal, intranasal or 
15 intraocular injections. A particularly useful method of administering compounds for treating 

central nervous system disease involves surgical implantation of a device for delivering the 
compound over an extended period of time. Sustained release formulations of the invented 
medicaments particularly are contemplated. 

Composition/Formulation 

20 Pharmaceutical compositions and medicaments for use in accordance with the present 

invention may be formulated in a conventional manner using one or more physiologically 
acceptable carriers comprising excipients and auxiliaries. Proper formulation is dependent upon 
the route of administration chosen. 

For injection, the agents of the invention may be formulated in aqueous solutions, 

25 preferably in physiologically compatible buffers such as Hanks's solution, Ringer's solution, or 

physiological saline buffer such as a phosphate or bicarbonate buffer. For transmucosal 
administration, penetrants appropriate to the barrier to be permeated are used in the formulation. 
Such penetrants are generally known in the art. 

Pharmaceutical preparations which can be used orally include push-fit capsules made of 

30 gelatin, as well as soft, sealed capsules made of gelatin and a plasticizer, such as glycerol or 

sorbitol. The push-fit capsules can contain the active ingredients in admixture with fillers such as 
lactose, binders such as starches, and/or lubricants such as talc or magnesium stearate and, 
optionally, stabilizers. In soft capsules, the active compounds may be dissolved or suspended in 
suitable liquids, such as fatty oils, liquid paraffin, or liquid polyethylene glycols. In addition, 

35 stabilizers may be added. All formulations for oral administration should be in dosages suitable 
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for such administration. 

For buccal administration,the compositions may take the form of tablets or lozenges 
formulated in conventional manner. 

For administration by inhalation, the compounds for use according to the present 
5 invention are conveniently delivered in the form of an aerosol spray presentation from pressurized 

packs or a nebulizer, with the use of a suitable gaseous propellant, e.g., carbon dioxide. In the 
case of a pressurized aerosol the dosage unit may be determined by providing a valve to deliver a 
metered amount. Capsules and cartridges of, e.g., gelatin, for use in an inhaler or insufflator, may 
be formulated containing a powder mix of the compound and a suitable powder base such as 
10 lactose or starch. 

The compounds may be formulated for parenteral administration by injection, e.g., by 
bolus injection or continuous infusion. Formulations for injection may be presented in unit 
dosage form, e.g., in ampoules or in multi-dose containers, with an added preservative. The 
compositions may take such forms as suspensions, solutions or emulsions in aqueous vehicles, 
1 5 and may contain formulatory agents such as suspending, stabilizing and/or dispersing agents. 

Pharmaceutical formulations for parenteral administration include aqueous solutions of 
the active compounds in water-soluble form. Aqueous suspensions may contain substances which 
increase the viscosity of the suspension, such as sodium carboxymethyl cellulose, sorbitol, or 
dextran. Optionally, the suspension may also contain suitable stabilizers or agents which increase 
20 the solubility of the compounds to allow for the preparation of highly concentrated solutions. 

Alternatively, the active ingredient may be in powder or lyophilized form for constitution 
with a suitable vehicle, such as sterile pyrogen-free water, before use. 

In addition to the formulations described previously, the compounds may also be 
formulated as a depot preparation. Such long acting formulations may be administered by 
25 implantation (for example subcutaneously or intramuscularly) or by intramuscular injection. 

Thus, for example, the compounds may be formulated with suitable polymeric or hydrophobic 
materials (for example as an emulsion in an acceptable oil) or ion exchange resins, or as sparingly 
soluble derivatives, for example, as a sparingly soluble salt 

Additionally, the compounds may be delivered using a sustained-release system, such as 
30 semipermeable matrices of solid hydrophobic polymers containing the therapeutic agent. Various 

sustained release materials have been established and are well known by those skilled in the art. 
Sustained-release capsules may, depending on their chemical nature, release the compounds for a 
few weeks up to over 1 00 days. 

Depending on the chemical nature and the biological stability of the therapeutic reagent, 
35 additional strategies for protein stabilization may be employed. 
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The pharmaceutical compositions also may comprise suitable solid or gel phase carriers 
or excipients. Examples of such carriers or excipients include but are not limited to calcium 
carbonate, calcium phosphate, various sugars, starches, cellulose derivatives, gelatin, and 
polymers such as polyethylene glycols. 
5 Effective Dosage 

Pharmaceutical compositions suitable for use in the present invention include 
compositions wherein the active ingredients are contained in an effective amount to achieve their 
intended purpose* More specifically, a therapeutically effective amount means an amount 
effective to prevent development of or to alleviate the existing symptoms of the subject being 

1 0 treated. Determination of the effective amounts is well within the capability of those skilled in the 

art, especially in light of the detailed disclosure provided herein. 

For any compound used in the method of the invention, the therapeutically effective dose 
can be estimated initially from cell culture assays, and a dose can be formulated in animal models. 
Such information can be used to more accurately determine useful doses in humans. 

15 A therapeutically effective dose refers to that amount of the compound that results in 

amelioration of symptoms in a patient. Toxicity and therapeutic efficacy of such compounds can 
be determined by standard pharmaceutical procedures in cell cultures or experimental animals, 
e.g., for determining the LD50, (the dose lethal to 50% of the test population) and the ED50 (the 
dose therapeutically effective in 50% of the population). The dose ratio between toxic and 

20 therapeutic effects is the therapeutic index and it can be expressed as the ratio between LDSO and 

EDSO. Compounds which exhibit high therapeutic indices are preferred. 

The data obtained from these cell culture assays and animal studies can be used in 
formulating a range of dosage for use in human. The dosage of such compounds lies preferably 
within a range of circulating concentrations that include the ED50, with little or no toxicity. The 

25 dosage may vary within this range depending upon the dosage form employed and the route of 

administration utilized. The exact formulation, route of administration and dosage can be chosen 
by the individual physician in view of the patient's condition. (See, e.g., Fingl et al.» 1975, in "The 
Pharmacological Basis of Therapeutics", Ch. 1). 

30 Computer-Related Embodiments 

As used herein the term "nucleic acid codes of the invention" encompass the nucleotide 
sequences comprising, consisting essentially of, or consisting of any one of the following: 

a) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 
150, 200, 500, 1000 or 2000 nucleotides of SEQIDNo. 1, and the complements thereof, 
35 wherein said contiguous span comprises at least one of the following nucleotide positions of 
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SEQ ID No 1: 201 123 to 247802 and 199122 to 249803; 

b) a contiguous span of at least 8, 12,15, 18,20,25,30,35,40,45,50,55,60,65,70, 
75, 80, 90, 100 or 200 nucleotides, to the extent that such a length is consistent with the 
particular sequence ID, of SEQ ID Nos. 2 to 17 or the complements thereof; 
5 c) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 1 00, 

150, 200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 24, 25, 28 and 74, and the 
complements thereof, to the extent that such a length is consistent with the particular sequence 
ID; 

d) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 

10 150,200,500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 79 to 132, and the complements 

thereof, to the extent that such a length is consistent with the particular sequence ID; 

e) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90 or 1 00 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 
span comprises at least one of the following nucleotide positions of SEQ ID No 1 : 201 123 to 

15 201234, 201123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 

to 2 17077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 199122 to 201 122 
and 247803 to 249803; 

f) a contiguous span according to a), b), c), d) or e), wherein said span includes a 
20 bialleiic marker; 

g) a contiguous span according to a), b), c), d) or e), wherein said span includes a 
bialleiic marker selected from the group consisting of A13 to Al 8, A20 to A46, A49 to A52, 
A55, A57, A59 to A63, A72 to A73, A76 and A123 ; and 

h) a contiguous span according to f), wherein said span includes a bialleiic marker with 
25 the alternative allele present at said bialleiic marker; 

The "nucleic acid codes of the invention" further encompass nucleotide sequences 
homologous to a contiguous span of at least 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 200, 500, 
1000 or 2000 nucleotides, to the extent that such a length is consistent with the particular 
sequence of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132, and the complements thereof. 

30 . The "nucleic acid codes of the invention" also encompass nucleotide sequences homologous to 
a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 90 or 
100 nucleotides of SEQ ID No. 1 or the complements thereof, wherein said contiguous span 
comprises at least one of the following nucleotide positions of SEQ ID No. 1 ; 201 123 to 
201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 21 6836 

35 to 217077, 21 7671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
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240440 to 241 153, 241072 to 241291, 244353 to 244561, 246273 to 247802, 201 123 to 
247802, 199122 to 201 122, 247803 to 249803 and 199122 to 249803. 

Homologous sequences refer to a sequence having at least 99%, 98%, 97%, 96%, 
95%, 90%, 85%, 80%, or 75% homology to these contiguous spans. Homology may be 
5 determined using any method described herein, including BLAST2N with the default 

parameters or with any modified parameters. Homologous sequences also may include RNA 
sequences in which uridines replace the thymines in the nucleic acid codes of the invention. It 
will be appreciated that the nucleic acid codes of the invention can be represented in the 
traditional single character format (See the inside back cover of Stryer, Lubert. Biochemistry, 

rr! 

10 3 edition. W. H Freeman & Co., New York.) or in any other format or code which records 

the identity of the nucleotides in a sequence. 

As used herein the term "polypeptide codes of SEQ ID Nos. 1 8 to 23" encompasses the 
polypeptide sequence of SEQ ID Nos 1 8 to 23, polypeptide sequences homologous to the 
polypeptides of SEQ ID Nos. 1 8 to 23, or fragments of any of the preceding sequences. 

15 Homologous polypeptide sequences refer to a polypeptide sequence having at least 99%, 98%, 

97%, 96%, 95%, 90%, 85%, 80%, 75% homology to one of the polypeptide sequences of SEQ CD 
Nos. 1 8 to 23 . Homology may be determined using any of the computer programs and parameters 
described herein, including FASTA with the default parameters or with any modified parameters. 
The homologous sequences may be obtained using any of the procedures described herein or may 

20 result from the correction of a sequencing error as described above. The polypeptide fragments 

comprise at least 4, 6, 8, 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, or 150 consecutive amino acids of 
the polypeptides of SEQ ID Nos. 18 to 23. Preferably, the fragments are novel fragments. It will be 
appreciated that the polypeptide codes of the SEQ ID Nos. 1 8 to 23 can be represented in the 
traditional single character format or three letter format (See the inside back cover of Starrier, 

25 Lubert. Biochemistry, 3 rd edition. W. H Freeman & Co., New York.) or in any other format which 

relates the identity of the polypeptides in a sequence. 

It will be appreciated by those skilled in the art that the nucleic acid codes of SEQ ID 
Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132 and polypeptide codes of SEQ ID Nos. 1 8 to 23 can 
be stored, recorded, and manipulated on any medium which can be read and accessed by a 

30 computer. As used herein, the words "recorded'* and "stored" refer to a process for storing 

information on a computer medium. A skilled artisan can readily adopt any of the presently known 
methods for recording information on a computer readable medium to generate embodiment 
comprising one or more of nucleic acid codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 
132, or one or more of the polypeptide codes of SEQ ID Nos. 1 8 to 23. Another aspect of the 

35 present invention is a computer readable medium having recorded thereon at least 2, 5, 1 0, 15, 20, 
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25, 30, or 50 nucleic acid codes of SEQ ID Nos 1 to 17, 24, 25, 28, 74 and 79 to 132. Another 
aspect of the present invention is a computer readable medium having recorded thereon at least 2, 
5, 1 0, 1 5, 20, 25 7 30, or 50 polypeptide codes of SEQ ID Nos 1 8 to 23 . 

Computer readable media include magnetically readable media, optically readable media, 
5 electronically readable media and magnetic/optical media. For example, the computer readable 

media may be a hard disk, a floppy disk, a magnetic tape, CD-ROM, Digital Versatile Disk (DVD), 
Random Access Memory (RAM), or Read Only Memory (ROM) as well as other types of other 
media known to those skilled in the art. 

Embodiments of the present invention include systems, particularly computer systems 

1 0 which store and manipulate the sequence information described herein. One example of a 

computer system 100 is illustrated in block diagram form in Figure 12. As used herein, "a 
computer system" refers to the hardware components, software components, and data storage 
components used to analyze the nucleotide sequences of the nucleic acid codes of SEQ ID Nos 1 
to 17, 24, 25, 28, 74 and 79 to 132, or the amino acid sequences of the polypeptide codes of SEQ 

1 5 ID Nos. 1 8 to 23. In one embodiment, the computer system 100 is a Sun Enterprise 1 000 server 

(Sun Microsystems, Palo Alto, CA). The computer system 100 preferably includes a processor for 
processing, accessing and manipulating the sequence data. The processor 1 05 can be any well- 
known type of central processing unit, such as the Pentium IQ from Intel Corporation, or similar 
processor from Sun, Motorola, Compaq or International Business Machines. 

20 Preferably, the computer system 100 is a general purpose system that comprises the 

processor 1 05 and one or more internal data storage components 1 10 for storing data, and one or 
more data retrieving devices for retrieving the data stored on the data storage components. A 
skilled artisan can readily appreciate that any one of the currently available computer systems are 
suitable. 

25 In one particular embodiment, the computer system 1 00 includes a processor 1 05 

connected to a bus which is connected to a main memory 115 (preferably implemented as RAM) 
and one or more internal data storage devices 110, such as a hard drive and/or other computer 
readable media having data recorded thereon. In some embodiments, the computer system 100 
further includes one or more data retrieving device 1 1 8 for reading the data stored on the internal 

30 data storage devices 110. 

The data retrieving device 1 18 may represent, for example, a floppy disk drive, a compact 
disk drive, a magnetic tape drive, etc. In some embodiments, the internal data storage device 1 1 0 is 
a removable computer readable medium such as a floppy disk, a compact disk, a magnetic tape, etc. 
containing control logic and/or data recorded thereon. The computer system 100 may 

35 advantageously include or be programmed by appropriate software for reading the control logic 
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and/or the data from the data storage component once inserted in the data retrieving device. 

The computer system. 100 includes a display 120 which is used to display output to a 
computer user. It should also be noted that the computer system 1 00 can be linked to other 
computer systems 125a-c in a network or wide area network to provide centralized access to the 
5 computer system 100. 

Software for accessing and processing the nucleotide sequences of the nucleic acid codes 
ofSEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132, or the amino acid sequences of the 
polypeptide codes of SEQ ID Nos. 18 to 23 (such as search tools, compare tools, and modeling 
tools etc.) may reside in main memory 115 during execution. 

1 0 In some embodiments, the computer system 1 00 may further comprise a sequence 

comparer for comparing the above-described nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 25, 
28, 74 and 79 to 132 or polypeptide codes of SEQ ID "Nos. 1 8 to 23 stored on a computer readable 
medium to reference nucleotide or polypeptide sequences stored on a computer readable medium. 
A "sequence comparer" refers to one or more programs which are implemented on the computer 

1 5 system 100 to compare a nucleotide or polypeptide sequence with other nucleotide or polypeptide 

sequences and/or compounds including but not limited to peptides, peptidomimetics, and chemicals 
stored within the data storage means. For example, the sequence comparer may compare the 
nucleotide sequences of the nucleic acid codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 
1 32, or the amino acid sequences of the polypeptide codes of SEQ ID Nos. 1 8 to 23 stored on a 

20 computer readable medium to reference sequences stored on a computer readable medium to 

identify homologies, motifs implicated in biological function, or structural motifs. The various 
sequence comparer programs identified elsewhere in this patent specification are particularly 
contemplated for use in this aspect of the invention. 

Figure 1 3 is a flow diagram illustrating one embodiment of a process 200 for comparing a 

25 new nucleotide or protein sequence with a database of sequences in order to determine the 

homology levels between the new sequence and the sequences in the database. The database of 
sequences can be a private database stored within the computer system 1 00, or a public database 
such as GENBANK, PIR OR SWISSPROT that is available through the Internet. 

The process 200 begins at a start state 201 and then moves to a state 202 wherein the new 

3 0 sequence to be compared is stored to a memory in a computer system 1 00. As discussed above, the 

memory could be any type of memory, including RAM or an internal storage device. 

The process 200 then moves to a state 204 wherein a database of sequences is opened for 
analysis and comparison. The process 200 then moves to a state 206 wherein the first sequence 
stored in the database is read into a memory on the computer. A comparison is then performed at a 

3 5 state 210 to determine if the first sequence is the same as the second sequence. It is important to 
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note that this step is not limited to performing an exact comparison between the new sequence and 
the first sequence in the database. Well-known methods are known to those of skill in the art for 
comparing two nucleotide or protein sequences, even if they are not identical. For example, gaps 
can be introduced into one sequence in order to raise the homology level between the two tested 
5 sequences. The parameters that control whether gaps or other features are introduced into a 

sequence during comparison are normally entered by the user of the computer system. 

Once a comparison of the two sequences has been performed at the state 210, a 
determination is made at a decision state 210 whether the two sequences are the same. Of course, 
the term "same" is not limited to sequences that are absolutely identical. Sequences that are within 

10 the homology parameters entered by the user will be marked as "same" in the process 200. 

If a determination is made that the two sequences are the same, the process 200 moves to a 
state 214 wherein the name of the sequence from the database is displayed to the user. This state 
notifies the user that the sequence with the displayed name fulfills the homology constraints that 
were entered. Once the name of the stored sequence is displayed to the user, the process 200 

1 5 moves to a decision state 2 1 8 wherein a determination is made whether more sequences exist in the 

database. If no more sequences exist in the database, then the process 200 terminates at an end 
state 220. However; if more sequences do exist in the database, then the process 200 moves to a 
state 224 wherein a pointer is moved to the next sequence in the database so that it can be 
compared to the new sequence. In this manner, the new sequence is aligned and compared with 

20 every sequence in the database. 

It should be noted that if a determination had been made at the decision state 2 1 2 that the 
sequences were not homologous, then the process 200 would move immediately to the decision 
state 2 1 8 in order to determine if any other sequences were available in the database for 
comparison. 

25 Accordingly, one aspect of the present invention is a computer system comprising a 

processor, a data storage device having stored thereon a nucleic acid code of SEQ ID NOs. 1 to 
17, 24, 25, 28, 74 and 79 to 132 or a polypeptide code of SEQ IDNos 18 to 23, a data storage 
device having retrievably stored thereon reference nucleotide sequences or polypeptide 
sequences to be compared to the nucleic acid code of SEQ ID Nos. 1 to 17, 24> 25, 28, 74 and 

30 79 to 132 or polypeptide code of SEQ ID Nos. 1 8 to 23 and a sequence comparer for conducting 

the comparison. The sequence comparer may indicate a homology level between the sequences 
compared or identify structural motifs in the above described nucleic acid code of SEQ ID Nos. 
1 to 1 7, 24, 25, 28, 74 and 79 to 132 and polypeptide codes of SEQ ID Nos. 1 8 to 23, or it may 
identify structural motifs in sequences which are compared to these nucleic acid codes and 

35 polypeptide codes. In some embodiments, the data storage device may have stored thereon the 
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sequences of at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of SEQ ID Nos. 1 to 
17, 24, 25, 28, 74 and 79 to 132 or polypeptide codes of SEQ ID Nos. 18 to 23. 

Another aspect of the present invention is a method for determining the level of homology 
between a nucleic acid code of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132 and a reference 
5 nucleotide sequence, comprising the steps of reading the nucleic acid code and the reference 

nucleotide sequence through the use of a computer program which determines homology levels and 
determining homology between the nucleic acid code and the reference nucleotide sequence with 
the computer program. The computer program may be any of a number of computer programs for 
determining homology levels, including those specifically enumerated herein, including BLAST2N 

1 0 with the default parameters or with any modified parameters. The method may be implemented 

using the computer systems described above. The method may also be performed by reading 2, 5, 
10, 15, 20, 25, 30, or 50 of the above described nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 
25, 28, 74 and 79 to 132 through use of the computer program and determining homology 
between the nucleic acid codes and reference nucleotide sequences . 

15 Figure 14 is a flow diagram illustrating one embodiment of a process 250 in a computer 

for determining whether two sequences are homologous. The process 250 begins at a start state 
252 and then moves to a state 254 wherein a first sequence to be compared is stored to a 
memory. The second sequence to be compared is then stored to a memory at a state 256. The 
process 250 then moves to a state 260 wherein the first character in the first sequence is read 

20 and then to a state 262 wherein the first character of the second sequence is read. It should be 

understood that if the sequence is a nucleotide sequence, then the character would normally be 
either A, T, C, G or U. If the sequence is a protein sequence, then it should be in the single 
letter amino acid code so that the first and sequence sequences can be easily compared. 

A determination is then made at a decision state 264 whether the two characters are the 

25 same* If they are the same, then the process 250 moves to a state 268 wherein the next 

characters in the first and second sequences are read. A determination is then made whether 
the next characters are the same. If they are, then the process 250 continues this loop until two 
characters are not the same. If a determination is made that the next two characters are not the 
same, the process 250 moves to a decision state 274 to determine whether there are any more 

30 characters either sequence to read. 

If there aren't any more characters to read, then the process 250 moves to a state 276 
wherein the level of homology between the first and second sequences is displayed to the user. 
The level of homology is determined by calculating the proportion of characters between the 
sequences that were the same out of the total number of sequences in the first sequence. Thus, 

35 if every character in a first 100 nucleotide sequence aligned with a every character in a second 
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sequence, the homology level would be 1 00%. 

i Alternatively, the computer program may be a computer program which compares the 
nucleotide sequences of the nucleic acid codes of the present invention, to reference nucleotide 
sequences in order to determine whether the nucleic acid code of SEQ ID Nos. 1 to 17, 24, 25, 28, 
5 74 and 79 to 132 differs from a reference nucleic acid sequence at one or more positions. 

Optionally such a program records the length and identity of inserted, deleted or substituted 
nucleotides with respect to the sequence of either the reference polynucleotide or the nucleic acid 
code of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32. In one embodiment, the computer 
program may be a program which determines whether the nucleotide sequences of the nucleic acid 
1 0 codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 132 contain a bialielic marker or single 

nucleotide polymorphism (SNP) with respect to a reference nucleotide sequence. This single 
nucleotide polymorphism may comprise a single base substitution, insertion, or deletion, while 
this bialielic marker may comprise abour one to ten consecutive bases substituted, inserted or 
deleted. 

1 5 Another aspect of the present invention is a method for determining the level of homology 

between a polypeptide code of SEQ ID Nos, 1 8 to 23 and a reference polypeptide sequence, 
comprising the steps of reading the polypeptide code of SEQ ID Nos. 18 to 23 and the reference 
polypeptide sequence through use of a computer program which determines homology levels and 
determining homology between the polypeptide code and the reference polypeptide sequence using 

20 the computer program. 

Accordingly, another aspect of the present invention is a method for determining whether a 
nucleic acid code of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 differs at one or more 
nucleotides from a reference nucleotide sequence comprising the steps of reading the nucleic acid 
code and the reference nucleotide sequence through use of a computer program which identifies 

25 differences between nucleic acid sequences and identifying differences between the nucleic acid 

code and the reference nucleotide sequence with the computer program. In some embodiments, the 
computer program is a program which identifies single nucleotide polymorphisms. The method 
may be implemented by the computer systems described above and the method illustrated in Figure 
14. The method may also be performed by reading at least 2, 5, 10, 15, 20, 25, 30, or 50 of the 

30 nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 and the reference 

nucleotide sequences through the use of the computer program and identifying differences between 
the nucleic acid codes and the reference nucleotide sequences with the computer program. 

In other embodiments the computer based system may further comprise an identifier for 
identifying features within the nucleotide sequences of the nucleic acid codes of SEQ ID Nos. 1 to 

35 1 7, 24, 25, 28, 74 and 79 to 132 or the amino acid sequences of the polypeptide codes of SEQ ID 
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Nos. 18 to 23. 

An "identifier" refers to one or more programs which identifies certain features within 
the above-descri bed nucleotide sequences of the nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 
25, 28, 74 and 79 to 1 32 or the amino acid sequences of the polypeptide codes of SEQ ID Nos. 

5 18 to 23. In one embodiment, the identifier may comprise a program which identifies an open 

reading frame in the cDNAs codes of SEQ ID Nos 2 to 17 and 36 to 40. 

Figure 15 is a flow diagram illustrating one embodiment of an identifier process 300 for 
detecting the presence of a feature in a sequence. The process 300 begins at a start state 302 
and then moves to a state 304 wherein a first sequence that is to be checked for features is 

1 0 stored to a memory 1 1 5 in the computer system 1 00. The process 300 then moves to a state 306 

wherein a database of sequence features is opened. Such a database would include a list of each 
feature's attributes along with the name of the feature. For example, a feature name could be 
"Initiation Codon" and the attribute would be "ATG". Another example would be the feature 
name "TAATAA Box" and the feature attribute would be "TAATAA", An example of such a 

1 5 database is produced by the University of Wisconsin Genetics Computer Group 

(www-gcg.com). 

Once the database of features is opened at the state 306, the process 300 moves to a 
state 308 wherein the first feature is read from the database. A comparison of the attribute of 
the first feature with the first sequence is then made at a state 3 10. A determination is then 
20 made at a decision state 316 whether the attribute of the feature was found in the first sequence. 

If the attribute was found, then the process 300 moves to a state 318 wherein the name of the 
found feature is displayed to the user. 

The process 300 then moves to a decision state 320 wherein a determination is made 
whether move features exist in the database. If no more features do exist, then the process 300 
25 terminates at an end state 324. However, if more features do exist in the database, then the 

process 300 reads the next sequence feature at a state 326 and loops back to the state 310 
wherein the attribute of the next feature is compared against the first sequence. 

It should be noted, that if the feature attribute is not found in the first sequence at the 
decision state 316, the process 300 moves directly to the decision state 320 in order to 
30 determine if any more features exist in the database. 

In another embodiment, the identifier may comprise a molecular modeling program 
which determines the 3-dimensional structure of the polypeptides codes of SEQ ID Nos. 18 to 
23. In some embodiments, the molecular modeling program identifies target sequences that are 
most compatible with profiles representing the structural environments of the residues in known 
35 three-dimensional protein structures. (See, e.g., Eisenberg et al., U.S. Patent No. 5,436,850 
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issued July 25, 1995). In another technique, the known three-dimensional structures of proteins 
in a given family are superimposed to define the structurally conserved regions in that family. 
This protein modeling technique also uses the known three-dimensional structure of a 
homologous protein to approximate the structure of the polypeptide codes of SEQ ID Nos. 4 to 
5 8. (See e.g., Srinivasan, et al., U.S. Patent No; 5,557,535 issued September 17, 1996). 

Conventional homology modeling techniques have been used routinely to build models of 
proteases and antibodies. (Sowdhamini et al, Protein Engineering 10:207, 215 (1997)). 
Comparative approaches can also be used to develop three-dimensional protein models when 
the protein of interest has poor sequence identity to template proteins. In some cases, proteins 

1 0 fold into similar three-dimensional structures despite having very weak sequence identities. For 

example, the three-dimensional structures of a number of helical cytokines fold in similar three- 
dimensional topology in spite of weak sequence homology. 

The recent development of threading methods now enables the identification of likely 
folding patterns in a number of situations where the structural relatedness between target and 

15 template(s) is not detectable at the sequence level. Hybrid methods, in which fold recognition is 

performed using Multiple Sequence Threading (MST), structural equivalencies are deduced 
from the threading output using a distance geometry program DRAGON to construct a low 
resolution model, and a full-atom representation is constructed using a molecular modeling 
package such as QUANTA. 

20 According to this 3 -step approach, candidate templates are first identified by using the 

novel fold recognition algorithm MST, which is capable of performing simultaneous threading 
of multiple aligned sequences onto one or more 3-D structures. In a second step, the structural 
equivalencies obtained from the MST output are converted into interresidue distance restraints 
and fed into the distance geometry program DRAGON, together with auxiliary information 

25 obtained from secondary structure predictions. The program combines the restraints in an 

unbiased manner and rapidly generates a large number of low resolution model confirmations. 
In a third step, these low resolution model confirmations are converted into full-atom models 
and subjected to energy minimization using the molecular modeling package QUANTA. (See 
e.g., Aszodi et al., Proteins: Structure, Function, and Genetics, Supplement 1:38-42 (1997)). 

30 The results of the molecular modeling analysis may then be used in rational drug design 

techniques to identify agents which modulate the activity of the polypeptide codes of SEQ ID 
Nos. 18 to 23. 

Accordingly, another aspect of the present invention is a method of identifying a feature 
within the nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or the 
35 polypeptide codes of SEQ ID Nos. 1 8 to 23 comprising reading the nucleic acid code(s) or the 
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polypeptide code(s) through the use of a computer program which identifies features therein and 
identifying features within the nucleic acid code(s) or polypeptide code(s) with the computer 
program. In one embodiment, computer program comprises a computer program which 
identifies open reading frames. In a further embodiment, the computer program identifies 
5 structural motifs in a polypeptide sequence. In another embodiment, the computer program 

comprises a molecular modeling program. The method may be performed by reading a single 
sequence or at least 2, 5, 10, 15, 20, 25, 30, or 50 of the nucleic acid codes of SEQ ID Nos. 1 to 
17, 24, 25, 28, 74 and 79 to 132 or the polypeptide codes of SEQ ID Nos, 18 to 23 through the 
use of the computer program and identifying features within the nucleic acid codes or 

10 polypeptide codes with the computer program. 

The nucleic acid codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32 or the 
polypeptide codes of SEQ ID Nos. 18 to 23 may be stored and manipulated in a variety of data 
processor programs in a variety of formats. For example, the nucleic acid codes of SEQ ID Nos. 1 
to 17, 24, 25, 28, 74 and 79 to 132 or the polypeptide codes of SEQ ID Nos, 18 to 23 may be 

1 5 stored as text in a word processing file, such as MicrosoftWORD or WORDPERFECT or as an 

ASCII file in a variety of database programs familiar to those of skill in the art, such as DB2, 
SYBASE, or ORACLE. In addition, many computer programs and databases may be used as 
sequence comparers, identifiers, or sources of reference nucleotide or polypeptide sequences to be 
compared to the nucleic acid codes of SEQ ID Nos. 1 to 17, 24, 25, 28, 74 and 79 to 132 or the 

20 polypeptide codes of SEQ ID Nos. .18 to 23. The following list is intended not to limit the 

invention but to provide guidance to programs and databases which are useful with the nucleic acid 
codes of SEQ ID Nos. 1 to 1 7, 24, 25, 28, 74 and 79 to 1 32 or the polypeptide codes of SEQ ID 
Nos. 1 8 to 23 ♦ The programs and databases which may be used include, but are not limited to: 
MacPattern (EMBL), DiscoveryBase (Molecular Applications Group), GeneMine (Molecular 

25 Applications Group), Look (Molecular Applications Group), MacLook (Molecular Applications 

Group), BLAST and BLAST2 (NCBI), BLASTN and BLASTX (Altschul et al, J. Mol Biol 215: 
403 (1990)), FASTA (Pearson and Lipman, Proa Natl. Acad Sci. USA, 85: 2444 (1988)), 
FASTDB (Brutlag et al, Comp. App. Biosci. 6:237-245, 1990), Catalyst (Molecular Simulations 
Inc.), Catalyst/SHAPE (Molecular Simulations Inc.), Cerius 2 .DBAccess (Molecular Simulations 

30 Inc.), HypoGen (Molecular Simulations Inc.), Insight II, (Molecular Simulations Inc.), Discover 

(Molecular Simulations Inc.), CHARMm (Molecular Simulations Inc.), Felix (Molecular 
Simulations Inc.), DelPhi, (Molecular Simulations Inc.), QuanteMM, (Molecular Simulations Inc.), 
Homology (Molecular Simulations Inc.), Modeler (Molecular Simulations Inc.), ISIS (Molecular 
Simulations Inc.), Quanta/Protein Design (Molecular Simulations Inc.), WebLab (Molecular 

35 Simulations Inc.), WebLab Diversity Explorer (Molecular Simulations Inc.), Gene Explorer 
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(Molecular Simulations Inc.), SeqFold (Molecular Simulations Inc.), the EMBL/Swissprotein 
database, the MDL Available Chemicals Directory database, the MDL Drug Data Report data base, 
the Comprehensive Medicinal Chemistry database, Derwettts's World Drug Index database, the 
BioByteMasterFile database, the Genbank database, and the Genseqn database. Many other 
5 programs and data bases would be apparent to one of skill in the art given the present disclosure. 

Motifs which may be detected using the above programs include sequences encoding 
leucine zippers, helix-turn-helix motifs, glycosylation sites, ubiquitination sites, alpha helices, 
and beta sheets, signal sequences encoding signal peptides which direct the secretion of the 
encoded proteins, sequences implicated in transcription regulation such as homeoboxes, acidic 

10 stretches, enzymatic active sites, substrate binding sites, and enzymatic cleavage sites. 

Throughout this application, various publications, patents, and published patent 
applications are cited. The disclosures of the publications, patents, and published patent 
specifications referenced in this application are all hereby incorporated by reference in their 
entireties into the present disclosure to more fully describe the state of the art to which this 

15 invention pertains. 

EXAMPLES 

Several of the methods of the present invention are described in the following 
examples, which are offered by way of illustration and not by way of limitation. Many other 
20 modifications and variations of the invention as herein set forth can be made without departing 

from the spirit and scope thereof and therefore only such limitations should be imposed as are 
indicated by the appended claims. 

Example 1 

25 Identification Of Biallelic Markers - DNA Extraction 

Donors were unrelated and healthy. They presented a sufficient diversity for being 
representative of a heterogeneous population. The DNA from 100 individuals was extracted 
and tested for the detection of the biallelic markers. 

30 ml of peripheral venous blood were taken from each donor in the presence of EDTA. 
30 Cells (pellet) were collected after centrifugation for 10 minutes at 2000 rpm. Red cells were 

lysed by a lysis solution (50 ml final volume: 10 mM Tris pH7.6; 5 mM MgCl2; 10 mMNaCl). 
The solution was centrifuged (1 0 minutes, 2000 rpm) as many times as necessary to eliminate 
the residual red cells present in the supernatant, after resuspension of the pellet in the lysis 
solution. 

35 The pellet of white cells was lysed overnight at 42°C with 3.7 ml of lysis solution 
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composed of: 

- 3 ml TE 1 0-2 (Tris-HCl 1 0 mM, EDTA 2 mM) / NaCI 0 4 M 
-200 |xl SDS 10% 

- 500 fxl K-proteinase (2 mg K-proteinase in TE 1 0-2 / NaCI 0.4 M). 

5 

For the extraction of proteins, 1 ml saturated NaCI (6M) (1/3.5 v/v) was added. After 
vigorous agitation, the solution was centrifuged for 20 minutes at 10000 rpm. 

For the precipitation of DNA, 2 to 3 volumes of 1 00% ethanol were added to the 
previous supernatant, and the solution was centrifuged for 30 minutes at 2000 rpm. The DNA 
1 0 solution was rinsed three times with 70% ethanol to eliminate salts, and centrifuged for 20 

minutes at 2000 rpm. The pellet was dried at 37°C, and resuspended in 1 ml TE 1 0-1 or 1 ml 
water. The DNA concentration was evaluated by measuring the OD at 260 nm (1 unit OD - 50 
ug/ml DNA). To determine the presence of proteins in the DNA solution, the OD 260 / OD 280 
ratio was determined. Only DNA preparations having a OD 260 / OD 280 ratio between 1 .8 
1 5 and 2 were used in the subsequent examples described below. 

The pool was constituted by mixing equivalent quantities of DNA from each individual. 

Example 2 

Identification Of Biallelic Markers: Amplification Of Genomic DNA By PCR 
20 The amplification of specific genomic sequences of the DNA samples of Example 1 

was carried out on the pool of DNA obtained previously, in addition, 50 individual samples 
were similarly amplified. 

PCR assays were performed using the following protocol: 



Final volume 25 ul 

25 DNA 2ng/[il 

MgCl 2 2 mM 

dNTP(each) 200 uM 

primer (each) 2.9 ng/pl 

Ampli Taq Gold DNA polymerase 0.05 unit/jil 

30 PCR buffer (10x= 0.1 M TrisHCl pH8.3 0.5M KC1) lx 



Each pair of first primers was designed using the sequence information of genomic 
DNA sequences of SEQ ID Nos 1 to 78 disclosed herein and the OSP software (Hillier & 
Green, 1991). This first pair of primers was about 20 nucleotides in length and had the 
35 sequences disclosed in Table 4 in the columns labeled "Position range of amplification primer 
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in SEQ ID No." and "Complementary position range of amplification primer in SEQ ID "No.". 



Table 4 



Amp Ikon 


SEQ 
ID No 


Primer 
name 


Position range of 
amplification primer 
in SEQ ID 


Primer 
name 


Complementary 
position range of 
amplification primer 
in SEQ 3D 


99-27943 


1 


Bl 


7938 


7958 


CI 


8446 


8465 


99-27935 


1 


B2 


21365 


in or 

21385 


C2 


2 1 845 


21864 


8-128 


1 


B3 


65453 


65471 


C3 


65856 


65874 


99-31960 


1 


B4 


95034 


95053 


C4 


95543 


95563 


99-24656 


1 


B5 


107022 


1 ATA/) A 

107040 


C5 


1 A*7 A A C 

107495 


107513 


99-24634 


1 


B6 


160770 


160787 


C6 


161240 


161257 


99-24639 


1 


B7 


160279 


160298 


C7 


160785 


1 /A AAA 

1 60802 


99-7652 


1 


B8 


168813 


168830 


C8 


169331 


169351 


9946100 


1 


B9 


170666 


170686 


C9 


171153 


171173 


99-5862 


1 


B10 


173065 


173085 


C10 


173495 


173514 


99-5919 


1 


Bll 


189753 


189771 


CU 


190187 


190207 


99-24658 


1 


B12 


197116 


197135 


C12 


197555 


197572 


8-303 


1 


B13 


200587 


200606 


C13 


200994 


201012 


8-300 


1 


B14 


^ A'"* A A f\ 

202440 


A*** A C A 

202459 


C14 


202852 


202871 


8-299 


1 


B15 


20306 l 


203080 


C15 


203486 


203505 


8-296 


1 


B16 






C16 


2U4J4;> 


2U43o2 


99-24644 


1 


BI7 


204741 


204758 


C17 


205222 


205240 


8-252 


1 


B18 


204373 


204392 


C18 


204777 


204794 


8-295 




B19 


205034 


205052 


C19 


205435 


205453 


8-293 




B20 


205768 


205786 


C20 


206175 


206193 


8-292 




B21 


206316 


206333 


C21 


206725 


206742 


8-251 




B22 


207186 


207205 


C22 


207616 


207634 


8-289 




BI23 


208186 


208205 


C23 


208588 


208606 


8-287 




B24 


208786 


208805 


C24 


209189 


209208 


8-285 




B25 


209520 


209539 


C25 


209931 


209949 


8-283 




B26 


210219 


210237 


C26 


210620 


210638 


8-282 




B27 


210801 


210820 


C27 


211205 


211223 


8-281 




B28 


211182 


211199 


C28 


211596 


211613 
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8-279 




B29 


212290 


212309 


C29 


212698 


212716 


8-278 


I 


B30 


212662 


212679 


C30 


213091 


213109 


8-277 


I 


B31 


213783 


213803 


C31 


214062 


214082 


8-124 


{ 


B32 


214564 


214581 


C32 


214965 


214983 


8-273 


, 


B33 


215310 


215328 


C33 


215711 


215729 


8-126 




B34 


216687 


216704 


C34 


217082 


217101 


8-130 




B35 


216683 


216702 


C35 


217091 


217109 


99-5897 




B36 


219408 


219425 


C36 


219882 


219899 


99-24649 


1 


B37 


220505 


220522 


C37 


221004 


221021 


8-247 


— i — 


B38 


223011 


223028 


C38 


223549 


223566 


8-246 


— r~ 
__ — 


B39 


223962 


223978 


C39 


224588 


224605 


8-245 




B40 


224564 


224581 


C40 


225044 


225061 


8-119 


1 

— _ — 


B41 


230210 


230227 


C41 


230622 


230641 


99-15870 




B42 


231278 


231298 


C42 


231729 


231747 


99-16087 




B43 


235081 


235101 


C43 


235515 


235533 


8-155 


. 




B44 


238607 


238626 


C44 


239029 


239046 


99-16038 


. 


B45 


239405 


239425 


C45 


239862 


239880 


8-136 




B46 


239606 


239624 


C46 


240012 


240029 


99-15880 


— p— 


B47 


241700 


241717 


C47 


242151 


242171 


8-271 


— y 


B48 


243484 


243502 


C48 


243909 


243926 


8-267 


— i — 
_ — 


B49 


245105 


245123 


C49 


245518 


245535 


8-266 




B50 


245372 


245389 


C50 


245775 


245793 


8-265 




B51 


245754 


245772 


C51 


246155 


246173 


99-25940 


— i — 
— - — 


B52 


247513 


247533 


C52 


248023 


248043 


99-16032 




B53 


248204 


248223 


C53 


248588 


248606 


99-16105 




B54 


255697 


255715 


C54 


256133 


256152 


99-16033 




B55 


259885 


259902 


C55 


260324 


260342 


99-15875 




B56 


279626 


279644 


C56 


280154 


280173 


99-16052 




B57 


300153 


300170 


C57 


300660 


300680 


99*16047 




B58 


311615 


3H632 


C58 


312126 


312144 


99-25993 




B59 


315649 


315668 


C59 


316129 


316147 
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Amplicon 


SEQ 
ED No 


Primer 
name 


Position range of 
amplification primer 
in SEQ ID 


Primer 
name 


Complementary 
position range of 
amplification primer 
in SEQ ID 


8-102 


24 


B60 




ZVJ 


C60 




AO 7 


8-104 


25 


B61 




0\ 
Z. 1 


C61 






8-94 


26 


B62 




1 ft 

I 0 


C62 




A*[ ft 
4 lo 


8-95 


27 


B63 




1 ft 
I a 


C63 




41^ 


8-97 


28 


B64 




1 Q 
i o 


C64 


11 1 




8-98 


29 


B65 




1 O 


C65 


A 1 < 
41 .> 


•A 1A 

4o4 


99-14021 


30 


B66 




lo 


C66 


A£< 
40> 


47 / 


99-14364 


31 


B67 




OX 


C67 


A1A 

4,24 


A *ZA 

4j4 


99-15056 


32 


B68 




1 ft 


C68 


AQO 
4oZ 




99-15229 


33 


B69 




4.V) 


C69 


4/0 


/l QA 

4>4 


99-15232 


34 


B70 






C70 


A£7 
40 / 


4o> 


99-15253 


35 


B71 




"l O 


C71 


AZQ 


AT7 
4// 


99-15256 


36 


B72 




t Q 

lo 


C72 


Aid 
4iy 


4DO 


99-15258 


37 


B73 




i n 

19 


C73. 


oil 


C A 1 

541 


99-15261 


38 


B74 






C74 


AO 1 

481 


501 


99-15280 


39 


B75 


1 




C75 


521 


341 


99-15355 


40 


B76 


1 


1 o 
lo 


C76 


Alt 

4/1 


4oy 


99-15663 


41 


B77 


1 


' 1 o 

lo 


C77 


430 


450 


99-15664 


42 


B78 




19 


C78 


483 


CAT 

502 


99-15665 


43 


B79 






C79 


4jo 


4 /O 


99-15668 


44 


B80 




1 Q 

lo 


C80 


A 1C\ 


4^y 


99-15672 


45 


B81 




1 R 
I o 


C81 


JJJ 


S47 


99-15682 


46 


B82 




19 


C82 


482 


500 


99-16081 


47 


B83 




18 


C83 


443 


462 


99-16082 


48 


B84 




18 


C84 


512 


532 


99-20933 


49 


B85 




19 


C85 


432 


451 


99-20977 


50 


B86 




18 


C86 


491 


511 


99-20978 


51 


B87 




19 


C87 


446 


466 


99-20981 


52 


B88 




18 


C88 


429 


449 


99-20983 


53 


B89 




19 


C89 


431 


449 


99-22310 


54 


B90 




19 


C90 


446 


466 



WO 01/40493 PCT/IBOO/01507 

157 



99-25922 


55 


B91 


1 


21 


C91 


451 


471 


99-25950 


56 


B92 


1 


19 


C92 


479 


499 


99-25961 


57 


B93 


1 


20 


C93 


464 


484 


99-25965 


58 


B94 


1 


21 


C94 


451 


470 


99-25966 


59 


B95 


1 


21 


C95 


503 


523 


99-25967 


60 


B96 


1 


20 


C96 


473 


493 


99-25969 


61 


B97 


1 


21 


C97 


510 


530 


99-25972 


62 


B98 


1 


21 


C98 


428 


448 


99-25974 


63 


B99 


1 


21 


C99 


524 


544 


99-25977 


64 


B100 


1 


21 


C100 


520 


540 


99-25979 


65 


B101 


1 


19 


C101 


518 


538 


99-25985 


66 


B102 


1 


21 


CI 02 


449 


469 


99-25989 


67 


B103 


1 


19 


G103 


533 


553 


99-26146 


68 


B104 


1 


19 


C104 


436 


456 


99-26150 


69 


B105 


1 


19 


C105 


437 


454 


99-26153 


70 


B106 


1 


19 


C106 


429 


449 


99-26156 


71 


B107 


1 


IS 


C107 


491 


511 


99-27345 


72 


B108 


1 


18 


C108 


531 


551 


99-30329 


73 


BI09 


1 


18 


C109 


496 


514 


99-31798 


74 


B110 


1 


20 


C110 


491 


511 


99-5873 


75 


Bill 


11 


28 


CI 11 


466 


474 


99-6012 


76 


B112 


11 


29 


CI 12 


477 


490 


99-6080 


77 


B113 


11 


28 


C113 


519 


529 


99-7308 


78 


B114 


1 


18 


C114 


469 


489 



Preferably, the primers contained a common oligonucleotide tail upstream of the 
specific bases targeted for amplification which was useful for sequencing. 

Primers from the column labeled "Position range of amplification primer in SEQ ID 
5 No." contain the following additional PU 5' sequence: TGTAAAACGACGGCCAGT (SEQ ID 

No. 133); primers from the column labeled "Complementary position range of amplification 
primer in SEQ ID No." contain the following RP 5' sequence: CAGGAAACAGCTATGACC 
(SEQ ID No. 134). 

The synthesis of these primers was performed following the phosphoramidite method, on a 
10 GENSET UFPS 24.1 synthesizer. 
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DNA amplification was performed on a Genius II thermocycler. After heating at 95°C 
for 10 min, 40 cycles were performed. Each cycle comprised: 30 sec at 95°C, 54°C for 1 min, 
and 30 sec at 72°C. For final elongation, 10 min at 72°C ended the amplification. The 
quantities of the amplification products obtained were determined on 96-well microtiter plates, 
5 using a fluorometer and Picogreen as intercalant agent (Molecular Probes). 

Example 3 
Identification of Polymorphisms 
a) Identification of Biallelic Markers from Amplifted Genomic DNA of Example 2 
The sequencing of the amplified DNA obtained in Example 2 was carried out on ABI 
10 377 sequencers. The sequences of the amplification products were determined using automated 

dideoxy terminator sequencing reactions with a dye terminator cycle sequencing protocol. The 
products of the sequencing reactions were run on sequencing gels and the sequences were 
determined using gel image analysis (ABI Prism DNA Sequencing Analysis software (2.1.2 
version)). 

1 5 The sequence data were further evaluated to detect the presence of biallelic markers • 

within the amplified fragments. The polymorphism search was based on the presence of 
superimposed peaks in the electrophoresis pattern resulting from different bases occurring at the 
same position as described previously. 

The localization of the biallelic markers detected in the fragments of amplification are 

20 as shown below in Table 5. 



Table 5 
Biallelic Markers 



Amplicon 


BM 


Marker 
Name 


Polymor- 
phism 


SEQ 
ID No, 


BM 

position 
in SEQID 


Position of 
probes in 
SEQ ID No. 


Probe 
No. 


A1I1 1 A112 


99-27943 


Al 


99-27943-150 


G 


C 




8316 


8304 


8328 


PI 


99-27935 


A2 


99-27935-193 


G 


c 




21672 


21660 


21684 


P2 


8-128 


A3 


8-128-33 


C 


T 




65485 


65473 


65497 


P3 


99-31960 


A4 


99-31960-363 


A 


G 




95396 


95384 


95408 


P4 


99-24656 


A5 


99-24656-260 


A 


G 




107281 


107269 


107293 


P5 


99-24634 


A6 


99-24639-163 


A 


C 




160640 


160628 


160652 


P6 


99-24639 


A7 


99-24634-108 


A 


T 




160876 


160864 


160888 


P7 


99-7652 


A8 


99-7652-162 


A 


G 




168974 


168962 


168986 


P8 


99-16100 


A9 


99-16100-147 


A 


G 




170810 


170798 


170822 


P9 


99-5862 


A10 


99-5862-167 


A 


G 




173358 


173346 


173370 


P10 
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99-5919 


All 


99-5919-215 


A 


G 


1 


189957 


189945 


189969 


Pll 


99-24658 


A12 


99-24658-410 


A 


G 


1 


197163 


197151 


197175 


P12 


8-303 


Ali. 


8-303-435 


A 


G 


1 


200778 


200766 


200790 


P13 


8-300 


A14 


8-300-221 


A 


G 


1 


202651 


202639 


202663 


P14 


8-300 


A15 


8-300-193 


A 


G 


1 


202679 


202667 


202691 


P15 


8-299 


A16 


8-299-128 


A 


T 




203378 


203366 


203390 


P16 


8-296 


A17 


8-296-213 


A 


T 




204138 


204126 


204150 


P17 


8-252 


A18 


8-252-190 


C 


T 




204605 


204593 


204617 


P18 


99-24644 


A19 


99-24644-194 


A 


G 


1 


204934 


204922 


204946 


P19 


8-295 


A20 


8-295-248 


A 


C 


\ 


205206 


205194 


205218 


P20 


8-295 


A21 


8-295-125 


C 


T 




205329 


205317 


205341 


P21 


8-293 


A22 


8-293-130 


A 


G 




206064 


206052 


206076 


P22 


8-292 


A23 


8-292-398 


A 


G 




206545 


206533 


206557 


P23 


8-251 


A24 


8-251-322 


A 


G 




207313 


207301 


207325 


P24 


8-289 


A25 


8-289-322 


A 


G 




208285 


208273 


208297 


P25 


8-287 


A26 


8-287-249 


C 


T 




208960 


208948 


208972 


P26 


8-287 


A27 


8-287-86 


A 


T 


1 


209123 


209111 


209135 


P27 


8-285 


A28 


8-285-319 


A 


G 


1 


209631 


209619 


209643 


P28 


8-283 


A29 


8-283-278 


G 


C 


— I" 


210361 


210349 


210373 


P29 


8-283 


A30 


8-283-176 


A 


G 




210463 


210451 


210475 


P30 


8-283 


A31 


8-283-153 


G 


C 


1 


210486 


210474 


210498 


P31 


8-283 


A32 


8-283-56 


C 


T 


1 


210583 


210571 


210595 


P32 


8-282 


A33 . 


8-282-345 


G 


C 


1 


210879 


210867 


210891 


P33 


8-282 


A34 


8-282-260 


G 


T 


1 


210964 


210952 


210976 


P34 


8-282 


A35 


8-282-245 


A 


C 


1 


210979 


210967 


210991 


P35 


8-282 


A36 


8-282-174 


Variable 
motif 


1 


211050 






P36 


8-282 


A3 7 


8-282-92 


A 


T 




211132 


211120 


211144 


P37 


8-281 


A38 


8-281-367 


A 


G 




21 1247 


211235 


211259 


P38 


8-281 


A39 


8-281-299 


A 


G 




211315 


211303 


211327 


P39 


8-281 


A40 


8-281-248 


G 


C 




211366 


211354 


211378 


P40 


8-279 


A41 


8-279-197 


A 


C 




212520 


212508 


212532 


P41 


8-278 


A42 


8-278-289 


C 


T 




212821 


212809 


212833 


P42 


8-277 


A43 


8-277-250 


C 


T 




213833 


213821 


213845 


P43 
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8-246 
18-245 
8-119 
99-15870 



99-16087 



8-155 



99-16038 
\SA36 



99-15880 



8-271 
8-267 



99-25940 
99-16032 



99-15875 
99-16052 



99-16047 



99-25993 



Amplicon 



8-102 



160 



PCT/IBOO/01507 



1 8-124 


A A A 

A44 


J 8-273 


A45 


8-130/ 


A46- 


1 8-126 




199-5897 


A47 


99-24649 


A48 


8-247 


A49 



A54 



A55 



AS6 
A57 



A58 
A59 



A60 



A61 
A62 
A63 



A64 



8-124-294 
8-273-383 
8-130-143 / 
8-126-286 



A 

[a" 

IA 



99-5897-143 |A 
99-24649-186 A 
A 



8-247-337 



8-246-393 
8-245-270 



8-119-247 I A 
99-15870-400 A 



99-16087-219 |G 
8-155-258 IC 



99-16038-1 18 |C 
8-136-267 



C 
G~ 



G 



T 
T~ 



99-15880-162 |A 
8-271-227 |A 



G 

cf 



8-267-273 Deletion 



266-131 
■265-208 
:-265-203 



A 
IA" 



A65 
A66 
A67 



A68 
A69 



A70 



A71 



BM 



A 79 



99-25940-182 |C 



99-16032-292 |G 
99-16105-152 A 
99-16033-244 A 
99-15875-165 C 



C 
G" 
C~ 
T~ 



99-16052-214 |A 



99-16047-115 |A 



99-25993-367 |A 



Marker 
Name 



T 
~G 



G 



214857 
215347 
216967 



214845 
[215335 
216955 



219540 
220836 



219528 
1220824 



223230 [223218 



224213 
224792 



224201 
224780 



214869 
215359 
216979 



219552 
220848 



223242 



230395 
231677 



230383 
231665 



235315 
238789 



235303 
238777 



239763 1239751 



239763 
241861 
243700 



239751 



1241849 
243688 



245263 



245663 
245966 



224225 
224804 



230407 
231689 



235327 
238801 



239775 



239775 



241873 
243712 



245651 
245954 



245971 
247864" 



245959 



247852 



248315 
255848 
260099 



248303 
255836 
260087 



245675 
245978 



245983 



P44 
~P45" 



P46 



P49 



P50 



P51 



P57 



P58 
P59" 



P60 



P61 
P62" 



247876 



248327 
255860 



279789 279777 



300365 300353 



312030 312018 



316014 316002 



Polymor- 
Iphism 



SEQ I BM 
ID No. position 



260111 



279801 



300377 



312042 



316026 



Position of 
probes in 
SEQ ED No. 



P63 



P64 



P65 
P66 



P67 



P68 



P69 



P70 



P71 



Probe ] 
No. 
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8-94 


A74 


8-94-252 


A 


G 


26 


251 


930 

AJ7 


20J 


r74 


8-95 


A75 


8-95-43 


A 


G 


27 




3 1 
j l 


jj 




8-97 


A76- 


8-97-181 


A 


G 


28 


1 81 


ioy 


ly3 


r76 


8-97 


A77 


8-97-98 


C 


T 




QS 

70 


50 


1 1 A 
i 10 


P77 


8-98 


A78 


8-98-68 


A 


G 

VJ 


i7 


AC 


JO 


80 


P78 


99-14021 


A79 


99-14021-108 


A 


VJ 




1U5 


yo 


120 


P79 


99-14364 


A80 


99-14364-415 


c 


T 


3 1 


41 j 


/! AO 

4U3 


427 


P80 


99-15056 


A81 


99-15056-99 


c 


T 


39 
jZ 


OQ 

yy 


0 / 


1 1 1 


P81 


99-15229. 


A82 


99-15229-412 


A 


VJ 


33 

jj 


412 


And 
4UU 


424 


P82 


99-15232 


A83 


99-15232-291 

77 — X J J ib <ti7 1 


VJ 


T 


J4 


2y \ 


2/y 


303 


P83 


99-15253 


A84 


99-15253-382 


c 


T 


35 


joz 


3 /U 


394 


P84 


99-15256 


A85 


99-15256-302 


p 

V-r 


T 
i 


JO 


too 
3y2 


IDA 

380 


404 


P85 


99-15258 


A86 


99-15258-337 


VJ 


T 
X 


J / 


1 "2 *T 
33 / 


lot 

325 


349 


P86 


99-15261 


A87 


99-15261-20? 

77 i 1— ^vz, 


A 


vJ 


1 Q 
JO 


O AA 

200 


188 


212 


P87 


99-15280 


A88 


99-15280-432 

7 7 1 JiOv "Jt 


P 


T 
1 


jy 


/no 
432 


420 


444 


P88 


99-15355 


A89 


99-15355-1 50 




T 


4U 


1 CA 

150 


138 


162 


P89 


99-15663 


A90 


99-15663-298 


P 


T 


4 1 


298 


286 


310 


P90 


99-15664 


A91 

.<V7 I 


09- 1 -5664-1 85 

77 i JUOt-l 0 J 


p 


nr 
1 


y*"1 
42 


t o c 

185 


173 


197 


P91' 


99-15665 


A92 


99-1 5665-308 

-7-7 i JUUJ"J70 


A 


VJ 


43 


398 


386 


410 


P92 


99-15668 


A93 


99-15668-1 30 


P 


T 




139 


127 


151 


P93 


99-15672 


A94 


99-15672-1 66 


p 


T 
X 


A5 
4D 


loo 


154 


178 


P94 


99-15682 


A95 


99-15682-318 


A 


T 
X 


40 


■510 

3 lo 


306 


330 


P95 


99-16081 


A96 


99-16081-217 

77* X WO X ^— 1 / 


p 


T 
X 


4 / 


21/ 


205 


229 


P96 


99-16082 


A97 


99-16082-218 


A 


Cr 

VJ 


AH 


9 1 8 
215 


2Uo 


230 


P97 


99-20933 


A98 


99-20933-81 


A 


p 

V-> 


*+7 


91 
O 1 


69 


93 


P98 


99-20977 


A99 


99-20977-72 


A 


p 




/X 


59 


83 


P99 


99-20978 


A100 


99-20978-89 

77 J&V7 / O Q 7 


VJ 


p 

Vw 


5 i 
j l 


on 

sy 


77 


101 


PI 00 


99-20981 


A101 


99-20981-300 


A 


G 


52 


300 


288 


312 


Pirn 


99-20983 


A102 


99-20983-48 


A 


G 


53 


48 


36 


60 


P102 


99-22310 


A103 


99-22310-148 


C 


T 


54 


148 


136 


160 


P103 


99-25922 


A104 


99-25922-147 


G 


T 


55 


147 


135 


159 


PI 04 


99-25950 


A105 


99-25950-121 


G 


C 


56 


121 


109 


133 


PI 05 


99-25961 


A106 


99-25961-376 


A 


C 


57 


373 


361 


385 


P106 


99-25965 


A107 


99-25965-399 


A 


G 


58 


399 


387 


411 


PI 07 
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99-25966 


A108 


99-25966-241 


A 


G 


59 


241 


229 


253 


P108 


99-25967 


A109 


99-25967-57 


A 


G 


60 


56 


44 


68 


P109 


99-25969 


Airo 


99-25969-200 


G 


T 


61 


200 


188 


212 


PUO 


99-25972 


Ain 


99-25972-317 


C 


T 


62 


315 


303 


327 . 


Pill 


99-25974 


Al 12 


99-25974-143 


A 


G 


63 


143 


131 


155 


P112 


99-25977 


A113 


99-25977-311 


A 


G 


64 


311 


299 


323 


P113 


99-25979 


A114 


99-25979-93 


A 


G 


65 


93 


81 


105 


P114 


99-25985 


Al 15 


99-25985-194 


C 


T 


66 


194 


182 


206 


P115 


99-25989 


A116 


99-25989-398 


A 


G 


67 


398 


386 


410 


P116 


99-26146 


All 7 


99-26146-264 


G 


T 


68 


264 


252 


276 


P117 


99-26150 , 


A118 


99-26150-276 


A 


G 


69 


276 


264 


288 


PI 18 


99-26153 


A119 


99-26153-44 


A 


C 


70 


44 


32 


56 


P119 


99-26156 


A120 


99-26156-290 


A 


C 


71 


290 


278 


302 


PI 20 


99-27345 


A121 


99-27345-189 


G 


C 


72 


189 


177 


201 


P121 


99-30329 


A122 


99-30329-380 


C 


T 


73 


380 


368 


392 


P122 


99-31798 


A123 


99-31798-344 


Variable 
motif 


74 


342 






P123 


99-5873 


A124 


99-5873-159 


C 


T 


75 


159 


147 


171 


P124 


99-6012 


A125 


99-6012-220 


G 


T 


76 


220 


208 


232 


P125 


99-6080 


A126 


99-6080-99 


C 


T 


77 


99 


87 


111 


P126 


99-7308 


A127 


99-7308-157 


C 


T 


78 


156 


144 


168 


P127 



BM refers to "biallelic marker". Alll and all2 refer respectively to allele 1 and allele 2 
of the biallelic marker. 

Certain biallelic markers of the invention are insertions or deletions, as indicated above. 
5 In particular, the biallelic marker 8-282- 1 74 comprises a variable motif AAAGG or 

GAAGGAAGGAAGGAAGGAAGA, The biallelic marker 8-267-273 comprises an insertion 
or deletion of GTGGGC. The biallelic marker 99-3 1798-344 comprises a variable motif CAT 
or ACTGTCTCTACCTCA 

10 Example 4 

Validation Of The Polymorphisms Through Microsequencing 

The biallelic markers identified in Example 3 were further confirmed and their 
respective frequencies were determined through microsequencing. Microsequencing was 
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carried out for each individual DNA sample described in Example 1. 

Amplification from genomic DNA of individuals was performed by PCR as described 
above for the detection of the biallelic markers with the same set of PCR primers (Table 3). 

The preferred primers used in microsequencing were about 19 nucleotides in length and 
5 hybridized just upstream of the considered polymorphic base. According to the invention, the 

primers used in microsequencing are detailed in Table 6. 



Table 6 



iviarKer rvalue 


Marker 


ID No. 




Position range of 
tnicroseq uencing 
primer mis. 1 in 
SEQIDNo. 


Mis. 2 


Complementary 
position range of 
microsequencing 
primer mis. 2 in 
SEO ID No. 


99-27943-150 


Al 


1 


Dl 


8297 


8315 


El 


8317 


8335 


99-27935-193 


A2 


1 


D2 


21653 


21671 


E2 


21673 


21691 


8-128-33 


A3 


1 


D3 


65466 


65484 


E3 


65486 


65504 


99-31960-363 


A4 


1 


D4 


95377 


95395 


E4 


95397 


95415 


99-24656-260 


A5 


1 


D5 


107262 


107280 


E5 


107282 


107300 , 


99-24639-163 


A6 


1 


D6 


160621 


160639 


E6 


160641 


160659 


99-24634-108 


A7 


1 


D7 


160857 


160875 


E7 


160877 


160895 


99-7652-162 


AS 


1 


D8 


168955 


168973 


E8 


168975 


168993 


99-16100^147 


A9 


1 


D9 


170791 


170809 


E9 


170811 


170829 


99-5862-167 


A10 


1 


D10 


173339 , 


173357 


E10 


173359 


173377 


99-5919-215 


All 


1 


Dlt 


189938 


189956 


Ell 


189958 


189976 


99-24658-410 


A12 




D12 


197144 


197162 


E12 


197164 


197182 


8-303-235 


A13 




D13 


200759 


200777 


E13 


200779 


200797 


8-300-221 


A14 




D14 


202632 


202650 


E14 


202652 


202670 


8-300-193 


A15 




D15 


202660 


202678 


E15 


202680 


202698 


8-299-128 


A16 




D16 


203359 


203377 


El 6 


203379 


203397 


8-296-213 


A17 




D17 


204119 


204137 


E17 


204139 


204157 


8-252-190 


A18 




D18 


204586 


204604 


El 8 


204606 


204624 


99-24644-194 


A19 




D19 


204915 


204933 


E19 


204935 


204953 


8-295-248 


A20 - 




D20 


205187 


205205 


E20 


205207 


205225 


8-295-125 


A21 




D21 


205310 


205328 


E21 


205330 


205348 


8-293-130 


A22 




D22 


206045 


206063 


E22 


206065 


206083 


8-292-198 


A23 




D23 


206526 


206544 


E23 


206546 


206564 


8-251-322 


A24 




D24 


207294 


207312 


E24 


207314 


207332 
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8-289-322 


A25 


1 


D25 


208266 


208284 


E25 


208286 


208304 


8-287-249 


A26 


1 


D26 


208941 


208959 


E26 


208961 


208979 


8-287-86 


A27 


1 


D27 


209104 


209122 


E27 


209124 


209142 


8-285-319 


A28 


1 


D28 


209612 


209630 


E28 


209632 


209650 


8-283-278 


A29 


1 


D29 


210342 


210360 


E29 


210362 


210380 


8-283-176 


A30 


1 


D30 


210444 


210462 


E30 


210464 


210482 


8-283-153 


A31 


1 


D31 


210467 


210485 


E31 


210487 


210505 


8-283-56 


A32 


1 


D32 


210564 


210582 


E32 


210584 


210602 


8-282-345 


A33 


1 


D33 


210860 


210878 


E33 


210880 


210898 


8-282-260 


A34 


1 


D34 


210945 


210963 


E34 


210965 


210983 


8-282-245 


A35 


1 


D35 


210960 


210978 


E35 


210980 


210998 


8-282-174 


A36 


1 


D36 






E36 






8-282-92 


A37 


1 


D37 


211113 


211131 


E37 


211133 


211151 


8-281-367 


A38 


1 


D38 


211228 


211246 


E38 


211248 


211266 


8-281-299 


A39 


-\- 


D39 


211296 


211314 


E39 


211316 


211334 


8-281-248 


A40 




D40 


211347 


211365 


E40 


211367 


211385 


8-279-197 


A41 


1 


D41 


212501 


212519 


E41 


212521 


212539 


8-278-289 


A42 


1 


D42 


212802 


212820 


E42 


212822 


212840 


8-277-250 


A43 


1 


D43 


213814 

> 


213832 


E43 


213834 


213852 


8-124-294 


A44 


1 


D44 


214838 


214856 


E44 


214858 


214876 


8-273-383 


A45 


1 


D45 


215328 


215346 


E45 


215348 


215366 


8-130-143 / 
8-126-286 


A46 


1 


D46 


216948 


216966 


E46 


216968 


216986 


99-5897-143 


A47 


1 


D47 


219521 


219539 


E47 


219541 


219559 


99-24649-186 


A48 


1 


D48 


220817 


220835 


E48 


220837 


220855 


8-247-337 


A49 


1 


D49 


223211 


223229 


E49 


223231 


223249 


8-246-393 


A50 


1 


D50 


224194 


224212 


E50 


224214 


224232 


8-245-270 


A51 




D51 


224773 


224791 


E51 


224793 


224811 


8-119-247 


A52 




D52 


230376 


230394 


E52 


230396 


230414 


99-15870-400 


A53 




D53 


231658 


231676 


E53 


231678 


231696 


99-16087-219 


A54 




D54 


235296 


235314 


E54 


235316 


235334 


8-155-258 


A55 




D55 


238770 


238788 


E55 


238790 


238808 


99-16038-118 


A56 




D56 


239744 


239762 


E56 


239764 


239782 


8-136-267 


A57 




D57 


239744 


239762 


E57 


239764 


239782 
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A^R 






741 R47 


241860 


E58 


241862 




R-971 -797 


ASQ 

-TV J 7 


■ 


D59 


243681 


243699 


E59 


243701 


24171 9 


RJ7A7.07'* 
o-ZO f-Z fj 






D60 






F60 








A£1 
AO 1 




1 






Pfi1 

JDO 1 








A A7 










JDOZ 






0 A/T< AAA 


A 

Aoj 








z*oy /u 




zsoy /Z 






A04 






A/170/1 C 


jL<* JoOj 






1A 7QC5 
ZH/5o3 


oq i £a*2a too 


J\OJ 






74S7QA 

Z4o/yo 




GO J 


Z453 10 


*)AQ 11A 

z*fo334 


99-16105-152 


A66 


1 


D66 


255829 


255847 


E66 


255849 


255867 


99-16033-244 


A67 




D67 


260080 


260098 


E67 


260100 


260118 


99-15875-165 


A68 




D68 


279770 


279788 


E68 


279790 


279808 


99-16052-214 


A69 


! 


D69 


300346 


300364 


E69 


300366 


300384 


99-16047-115 


A70 


. i 


D70 


312011 


312029 


E70 


312031 


312049 


99-25993-367 


A71 


! 


D71 


315995 


316013 


E71 


316015 


316033 


Marker Name 


Biallelic 
Marker 


SEQ 
ID No. 


Mis. 1 


Position range of 
microsequencing 
primer mis. 1 in 
SEQ ID No. 


Mis. 2 


Complementary 
position range of 
microsequencing 
primer mis. 2 in 
SEQ ID No, 


O 1 A A A 1 } t 

8-102-231 


A *7A 

A72 


24 


U/2 


212 


230 


C7A 

E72 


232 


250 


O 1 A /* 1 AC 

8-104-105 


A/3 


2d 




86 


104 


T?AA 

E73 


106 


124 


8-94-252 


A74 


A/I 


JJ/4 


232 


250 


T?Av* 

E74 


252 


270 


8-95-43 


A75 


2f 


UfD 


24 


42 


E75 


44 


62 


8-97-181 


A A£ 

A76 


A O 

28 


D76 


162 


180 


E76 


182 


200 


o-9 7-98 


A T7 
A/ / 


AO 


U t f 


79 


97 


T2T7 

Jb// 


99 


117 


Q AO £0 


A/0 






49 


67 


Jc- /o 


69 


87 


QQ IfW 

yy-iHUz,i-iuo 


A 7Q 

a /y 




j_//y 


89 


107 


17 7G 

n/y 


109 


127 


no a aula >i 1 < 


a on 
AoU 




iJoU 


396 


414 




416 


434 


77" A JUJl}"77 


ZA.O 1 


A2 


D81 


80 


98 


F81 


100 


118 


99-15229-412 


A82 


33 


D82 


393 


411 


E82 


413 


431 


99-15232-291 


A83 


34 


D83 


272 


290 


E83 


292 


310 


99-15253-382 


A84 


35 


D84 


363 


381 


E84 


383 


401 


99-15256-392 


A85 


36 


D85 


373 


391 


E85 


393 


411 


99-15258-337 


A86 


37 


D86 


318 


336 


E86 


338 


356 


99-15261-202 


A87 


38 


D87 


181 


199 


E87 


201 


219 
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99-15280-432 


A88 


39 


D88 


411 

HI J 


411 


LjOO 


413 


Hj 1 


99-15355-150 


A89 


40 


D89 


111 


149 


F89 


151 
i j i 


1 60 

i oy 


99-1 ^663-298 


A90 


41 

*T 1 


D90 


970 


297 


JC/7V 


900 


117 


00-1 S664-1 


A91 


4? 


D91 


1 DO 


1 0*f 


P91 

XZ»7 I 


1 

1 OO 


9AA 
ZU4 


yy-i joo j-jyo 


AQ9 


41 


no? 


170 


107 

jy ( 


P09 

nyz 


1O0 

jyy 


41/ 


OO 1 <!6£R 1 10 

yy-i jooo-i J7 


A 01 


4H 


noi 

U7J 


1 9A 


i ill 

1 JO 


Jbyj 


14U 


1 Jo 


yy-i jo /z-i oo 


A 04 

Ay4 


4 J 


TiQA 

i-/y4 


1 AO 
14 / 


lOJ 


iiy4 


1 /C7 
10/ 


1 oc 
185 


QO 1 <6C9 7 1 ft 

yy- 1 jOoz-j l o 


AQC 


40 


uyj 


900 

zyy 


1 1 7 
3 1/ 


I70C 

r.yj 


no 
3 ly 


337 


on i *CA0 X 717 

yy-iouo i -zi / 


AQ£ 


AO 


FkO£ 


IAD 

iys 


ZlD 


T2C\C 

jbyo 


TIB 

z io 


236 


OO 1 £AC7 710 

yy-iouoz-zio 


/\y / 


yf 0 
45 


no7 
jjy / 


1 oo 


zl / 


tr07 

h»y / 


7 1 A 

zly 


237 


OO OrtOII 81 

yy-zuyjj-oi 


AQC 


4Q 

4y 


rioc 
L>yo 


OZ 




T7QO 


oZ 


1U0 


OO 7AQ77 79 


AQO 

/\yy 


ca 

jU 


noo 




7A 
/U 


r?oo 

fcyy 


77 


90 


OO 9AQ7R 80 


A 1 (\Ci 
A J UU 




rn ao 

1/1 UU 




Oft 
OO 


13 1 AA 


OA 

yu 


1 AO 


qo onoai -inn 
yy-zuyo 


A 1 fil 


JZ 


that 


OR 1 
Zo 1 


9QO 

zyy 


C 1 Al 


1A 1 
3U1 


7 1 A 

3 ly 


oo 7AOi!i 
yy-zuyoj-4o 


a i no 

A 1UZ 


^1 

JJ 


m A9 


90 

zy 


4 / 


c-lviZ 


/tA 

49 


67 


oo 791 i a 14R 
yy-zzj lu-iHo 


A 1 fll 


j4 


m ni 


1 90 

izy 


1 A 7 
14/ 


t?1 Al 


1 A A 

14V 
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QO 9^G79 1 yl*7 

yy-zjyzz-i4 / 


A 1 OA 


^< 
jj 


ni a a 


1 OQ 

1ZS 


140 


I? 1 A/I 

bl04 


t /in 

14s 


i 

166 


on 7cq<a 1 o 1 
yy-zjyju-izi 


A 1 A< 


JO 


JDIUj 


1 AT 
lUZ 


lZl) 


T? 1 AC 

bl05 


122 


140 


QQ 7<o/;i i*7A 
yy-zjyoi-3 /o 


A 1 A< 


<7 

J / 


Tl1 AX 


354 


372 


E106 


374 


392 


OO 7£QiCC 7QQ 

yy-zjyoj-3yy 


A 1 A7 

A1U/ 


<0 

jo 


n i A7 




398 


El 07 


400 


418 


QQ 7<Q££ OA 1 

yy-zjyoo-z4 1 


A 1 A3 

AlUo 


->y 


"Pi 1 AO 


zzz 


240 


T? 1 AO 

E108 


242 


260 


QQ 9<Q/C7 <7 

yy-zjyo/-j / 


A 1 AO 

Aiuy 


i£A 


Fll AO 

juiuy 


J / 


55 


El 09 


57 


75 


QQ 9<vQ/CQ 7AA 

yy-zjyoy-zuu 


A 1 1A 

Al l\) 


Ol 


TM 1 A 

Jul 1U 


1 ol 


199 


El 10 


201 


219 


QO 9*1079 11 7 

yy-zjy/z-oi / 


A 1 1 1 
Ai 1 1 


<7 

OZ 


Ul 1 I 


zyo 


0 1 A 

314 


El 1 1 

bin 


316 


334 


QQ-9^Q74 141 


A 1 1 0 
A 1 1Z 


£1 


FV1 1 0 
Jul 1 Z 


19A 
1Z4 


14Z 


bl 12 


1 A A 

144 


1 /C7 

loz 


QQ-9SQ77 11 1 

77 - iiJ7 //Oil 


A 1 11 

Al 1j> 


fid 
04 


Til 1 1 


909 

zyz 


j 1U 


bl 13 


7 17 

31z 


77 A 

330 


QQ_7CQ7Q 01 


A 1 1 4 
A I 1 4 


OJ 


iyl 14 


OA 
/4 


Al 

yz 


PI 1 4 

bl 14 


94 


112 


99-25985-194 


A115 


66 


D115 


175 


I 7 J 


F1 


19S 

I 7J 


211 


99-25989-398 


AI16 


67 


D116 


379 


397 


El 16 


399 


417 


99-26146-264 


A117 


68 


Din 


245 


263 


E117 


265 


283 


99-26150-276 ~ 


A118 


69 


D118 


257 


275 


E118 


277 


295 


99-26153-44 


A119 


70 


D119 


25 


43 


El 19 


45 


63 


99-26156-290 


A120 


71 


D120 


271 


289 


E120 


291 


309 


99-27345-189 


A121 


72 


D121 


170 


188 


E121 


190 


208 
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99-30329-380 


A122 


73 


D122 


361 


379 


E122 


381 


399 


99-31798-344 


A123 


74 


D123 






E123 






99-5873-159 


A124 


75 


D124 


140 


158 


E124 


160 


178 


99-6012-220 


A125 


76 


D125 


201 


219 


E125 


221 


239 


99-6080-99 


A126 


77 


D126 


80 


98 


E126 


100 


118 


99-7308-157 


A127 


78 


D127 


137 


155 


E127 


157 


175 



Mis 1 and Mis 2 respectively refer to microsequencing primers which hybridized with 
the coding strand or with the non-coding strand of the nuceotide sequences of the invention. 
The microsequencing reaction was performed as follows : 
5 After purification of the amplification products, the microsequencing reaction mixture 

was prepared by adding, in a 20 ul final volume: 10 pmol microsequencing oligonucleotide, 1 TJ 
Thermosequenase (Amersham E79000G), 1 ,25 ul Thermosequenase buffer (260 mM Tris HC1 
pH 9.5, 65 mM MgCl 2 ), and the two appropriate fluorescent ddNTPs (Perkin Elmer, Dye 
Terminator Set 401095) complementary to the nucleotides at the polymorphic site of each 
1 0 biallelic marker tested, following the manufacturer's recommendations. After 4 minutes at 

94°C, 20 PGR cycles of 1 5 sec at 55°C, 5 sec at 72°C, and 10 sec at 94°C were carried out in a 
Tetrad PTC-225 thermocycler (MJ Research). The unincorporated dye terminators were then 
removed by ethanol precipitation. Samples were finally resuspended in formamide-EDTA 
loading buffer and heated for 2 min at 95°C before being loaded on a polyacrylamide 
15 sequencing gel. The data were collected by an ABI PRISM 377 DNA sequencer and processed 

using the GENESCAN software (Perkin Elmer). 

Following gel analysis, data were automatically processed with software that allows the 
determination of the alleles of bialleLic markers present in each amplified fragment. 

The software evaluates such factors as whether the intensities of the signals resulting 
20 from the above microsequencing procedures are weak, normal, or saturated, or whether the 

signals are ambiguous. In addition, the software identifies significant peaks (according to shape 
and height criteria). Among the significant peaks, peaks corresponding to the targeted site are 
identified based on their position. When two significant peaks are detected for the same 
position, each sample is categorized classification as homozygous or heterozygous type based 
25 on the height ratio. 



Example 5a 

Association Study Between Schizophrenia And The Biallelic Markers Of The Invention 
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Collection Of DNA Samples From Affected And Non-Affected Individuals 
A^ Affected population 

AH thejampies were collected from a large epidemiological study of schizophrenia 
undertaken in hospital centers of Quebec from October 1995 to April 1997. The population was 
5 composed of French Caucasian individuals. The study design consisted in the ascertainment of 

cases and two of their first degree relatives (parents or siblings). 

As a whole, 956 schizophrenic cases were ascertained according to the following 
inclusion criteria: 

- the diagnosis had been done by a psychiatrist; 

10 - the diagnosis had been done at least 3 years before recruitment time, in order to 

exclude individuals suffering from transient manic-depressive psychosis or depressive 
disorders; 

- the patient ancestors had been living in Quebec for at least 6 generations; 

- it was possible to get a blood sample from 2 close relatives. 

1 5 Among the 956 schizophrenic ascertained cases, 834 individuals were included in the 

study for the following reasons: 

- for the included individual cases, the diagnosis of schizophrenia was established 
according to the DSM-IV (Diagnostic and Statistical Manual, Fourth edition, Revised 1994, 
American Psychiatric Press); 

20 - samples from individuals suffering from schizoaffective disorder were discarded; 

- individuals suffering from catatonic schizophrenia were also excluded from the 
population of schizophrenic cases; 

- were also excluded the individuals having a first degree relative or 2 or more second 
degree relatives suffering from depression or mood disorder; 

25 - individuals having had severe head trauma, severe obstretical complications, 

encephalitis, or meningitis before onset of symptoms were also excluded; 

- has also been excluded from the population of schizophrenic cases a patient suffering 
from epilepsy and treated with anticonvulsants. 

The age at onset was not added as an inclusion criteria. 
30 Bl Unaffected population 

Control cases were respectively ascertained based on the following cumulative criteria: 

- the individual must not be affected by schizophrenia or any other psychiatric disorder; 

- the individual must have 35 years old or more; 

-the individual must belong to the French-Canadian population; 
35 - the individual must have one or two first degree relative available for blood sampling. 
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Controls were matched with cases sex when possible. 
C) Cases and Control Populations Selected for the Association Study 
The unaffected population retained for the study was composed of 241 individuals. The 
initial sample of the clinical study was composed of 21 5 cases and 214 controls. The controls 
5 were composed of 1 16 males and 98 females while the cases were composed of 1 54 males and 

64 females. For each control, two first degree relatives (father, mother, sisters and brothers) 
were available. In order to match the sex of cases and controls, the parents of female controls 
were substituted for the female controls where possible and where the parents were known to be 
unaffected by schizophrenia or other psychosis. The parents of 27 female controls were thus 
10 substituted for the respective females, resulting in a total control sample size of 241 individuals. 

The composition of the control sample is detailed below in Table 7, 

Table 7 



Description of control samples 



Probands 


187 


Male 


116 


Female 


71 


Parents of probands 


54 


Fathers 


27 


Mothers 


27 


Total 


241 



The association data that are presented below were obtained on a population size 
15 detailed in Table 8 below, wherein the individuals have been randomly selected from the 

populations detailed above. 



Table 8 



Cases and Control Populations Selected for the Association Study 






sample type 


Cases 


Controls 


sample size 




215 


241 


Gender 










Male 


151 


143 




Female 


64 


98 


Familial history of psychosis (FH)* 








positive (FH+) 


82 


0 




none(FH-) 


133 


241 
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* : close relatives (first or second degree) 



Both case and control populations form two groups, each group consisting of unrelated 
individuals that do not share a known common ancestor. Additionally, the individuals of the 
control population were selected among those having no family history of schizophrenia or 
5 schizophrenic disorder, 

Genotyping of affected and control individuals 
A) Results from the genorypinq 

The general strategy to perform the association studies was to individually scan the 
DMA samples from all individuals in each of the populations described above in order to 
10 establish the allele frequencies of biallelic markers, and among them the biallelic markers of the 

invention, in the diploid genome of the tested individuals belonging to each of these 
populations. 

Allelic frequencies of every biallelic marker in each population (cases and controls) 
were determined by performing microsequencing reactions on amplified fragments obtained by 
15 genomic PCR performed on the DNA samples from each individual. Genomic PCR and 

microsequencing were performed as detailed above in Examples 1 to 4 using the described PCR 
and microsequencing primers. 

Single biallelic marker frequency analysis 

For each allele of the biallelic markers included in this study, the difference between the 
20 allelic frequency in the unaffected population and in the population affected by schizophrenia 

was calculated and the absolute value of the difference was determined. The more the 
difference in allelic frequency for a particular biallelic marker or a particular set of biallelic 
markers, the more probable an association between the genomic region harboring this particular 
biallelic marker or set of biallelic markers and schizophrenia. Allelic frequencies were also 
25 useful to check that the markers used. in the haplotype studies meet the Hardy-Weinberg 

proportions (random mating). 

The allelic frequencies of biallelic markers in the chromosome 1 3q3 l-q33 region 
between the affected and the unaffected population, using the sample population described 
above, is set forth in Table 9. 
30 Table 9 

Allelic frequencies of markers in different sub-samples 



marker 


alleles 


all sample 






cases 


controls 






all 


HF+ 


HF- 
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0.47 
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0.44 


0.47 


99-15232/291 


C/T 


J 0.46 


0.48 


0.43 


0.47 j 


99-14021/108 


A/G 


j 0.46 


0.48 


0.44 


0.47 


8-98/68 


A/G 


J 0.20 


0.18 


0.23 


0.19 j 


8-97/98 


C/T 


j 0.78 


0.75 


0.81 


0.80 | 


99-6012/220 


C/T | 0.20 


0.19 


0.23 


0.19 


8-95/43 


A/G J 0.18 


0.20 


0.18 


0.21 j 


99-7308/157 


C/T 


0,39 


0.42 


0.36 


039 I 


99-14364/415 


C/T 


038 


0.40 


0.36 


039 j 


99-15672/166 


C/T | 0.51 


0.47 


0.54 


0.54 I 


99-15668/139 


C/T 


0.58 


0.56 


0.62 


0.65 


99-15665/398 


A/G 


0.72 


0.67 


0.72 


0.76 


99-15663/298 


C/T 


0.72 


0.67 


0.72 


0.76 


99-15664/185 


C/T 


0.69 


0.62 


0.72 


0.72 


99-15682/318 


A/T j 0.35 


0.40 


034 


032 


99-20933/81 


A/C j 


0.43 


0.41 


0.42 


0.40 


99-16081/217 


C/T 


0.43 


0.38 


0.46 


039 


99-16082/218 


A/G 

C/T 1 


0.33 


0.31 


035 


032 


99-5862/167 

t\C\ 1 J* •* f\/\fi a mm 




0.47 


0.43 


0.44 


0.51 j 


99-16100/147 


A/G 


0.48 


0.44 


0.45 


0.50 


99-7652/162 


A/G 


0.49 


0.46 


0.46 


0.52 


99-5919/215 


A/G 


0,66 


0.71 


0.69 


0.60 


99-5897/143 


A/C 0.58 


0.61 


0.53 


0.59 


99-15870/400 


A/G j 0.32 


0.38 


0.27 


0.33 


99-16032/292 


A/C 


0.61 


0.62 


0.64 


0.58 


99-15880/162 


A/G 


0.62 


0.63 


0.65 


0.58 


99-16038/118 


A/G 


0.38 


0.36 


0.35 


0.42 1 


99-15875/165 


C/T | 0.58 


0.57 


0.57 


0.63 


99-16033/244 


C/T 


0.55 


0.57 


0.49 


0.54 


99-16047/115 


C/T 


0.73 


0.75 


0.68 


0.73 



In the association study described herein, several individual biallelic 



markers were 
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shown to be significantly associated with schizophrenia. In particular, several of the 
chromosome 13q31-q33 region biallelic markers (99-16038/1 18 (A56), 99-7652/1 62 (A8), 99- 
5919/215 (All), 99-15875/165 (A68), 99-16032/292 (A9)) showed significant association with 
schizophrenia in both familial and sporadic schizophrenia cases. The significance of the 

5 absolute value of the difference of allelic frequency of the individual biallelic markers in the 

affected and the unaffected population is set forth in 2, with several biallelic marker having 
allelic frequency differences with p-values approaching or less than 0.05, biallelic marker 99- 
5919/215 (Al 1) having a p-value of less than 0.01 . Figure 2 also shows the physical order of 
certain specific biallelic markers. These results show that several biallelic markers individually 

10 associated with schizophrenia are physically located in a particular region of significance, the 

subregion of the chromosome 13q3 l-q33 region referred to herein as Region D. 
Haplotype frequency analysis 

Analysis of markers Haplotype analysis for association of chromosome 13q3 l-q33- 
related biallelic markers and schizophrenia was performed by estimating the frequencies of all 

15 possible 2, 3 and 4 marker haplotypes in the affected and control populations described above. 

Haplotype estimations were performed by applying the Expectation-Maximization (EM) 
algorithm (Excoffier and Slatkin, 1995), using the EM-HAP LO program (Hawley et al., 1994) 
as described above. Estimated haplotype frequencies in the affected and control population 
were compared by means of a ch i-square statistical test (one degree of freedom). 

20 Haplotype association results in schizophrenia cases 

The results of the haplotype analysis using the chromosome 3 3q3 l-q33-related biallelic 
markers biallelic markers is shown in Figure 3. In particular, the figures show the most 
significant haplotypes using the biallelic markers: 99-16047/1 15 (A 10), 99-16033/244 (A67), 
99-16038/1 18 (A56), 99-15875/165 (A68), 99-16032/292 (A9), 99-5897/143 (A47), 99- 

25 15880/162 (A58), 99-16082/218 (A97), 99-5919/215 (Al 1), 99-7652/162 (A8), 99-16100/147 

(A9), 99-5862/167 (A10). 

A number of biallelic marker haplotypes were shown to be significantly associated with 
schizophrenia. A first preferred haplotype (HAJP287 of Figure 3) consisting of four biallelic 
markers (99-16038/118 (A56), 99-16082/218 (A97), (99-7652/162 (A8) and 99-16100/147 

30 (A9)) is highly significantly associated with schizophrenia in both total cases and sporadic 

cases. Figure 4 shows the characteristics of this haplotype. This haplotype presented a p-value 

7 f% 
of 3.1x10 and ail odd-ratio of 4.01 for total cases and a p-value of 3.9x10 and an odd-ratio 

of 3,88 foT sporadic cases. Phenotypic permutation tests confirmed the statistical significance 

of these results. Estimated haplotype frequencies were 13.8% in total cases, 13.5% in the 

35 sporadic cases, and 3.8 % in the controls. 
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Several other significant haplotypes are listed in Figure 3, including several 2-, 3- and 
4-marker haplotypes. Considered to be highly significantly associated with schizophrenia are 
the most significant 2-marker haplotype (MAPI consisting of biallelic markers 99-15875/165 
(A68) and 99-5919/215 (Al 1)) and the most significant 3 -marker haplotype (HAP67 consisting 
5 of biallelic markers 99-1 603 8/11 8 (A56), 99-16082/218 (A97) and 99-7652/162 (A8)). 

Further preferrred significant haplotypes considered associated with schizophrenia are 
haplotypes having p-va!ues above a desired threshold level are also; all the haplotypes listed in 
Figure 3 present p-values below 1 .0x1 0" 2 for 2-marker haplotypes, 1.0x1 0" 4 for 3 -marker 
haplotypes, and 1 ,0x1 0* 5 for 4-marker haplotypes. All of the biallelic markers presented in 

10 Figure 4 except for 1 (99-16047/1 15 (A70)) are involved in haplotypes having a p-value above 

these threshold levels. Figure 3 shows several 2-marker haplotypes, HAP1 to HAP8, having p- 
vaiues ranging from 1 .0x1 0" 2 to 1.2xl0~ 3 , several 3-marker haplotypes, HAP67 to HAP76, 
having p-values ranging from 1 .3x1 0* 5 to 1 ,0x1 0^ and several 4-marker haplotypes, HAP287 to 
HAP291, having p-values ranging from 8.2xl0~ 7 to 3.1x1 0" 7 . Figure 4 shows biallelic markers 

1 5 involved in significant haplotypes having significance thresholds of 1 .0x1 0* 2 , 1 .0x1 0* 4 , and 

1 .0x1 0" 5 for 2-, 3- and 4-marker haplotypes, respectively. 

Several 2-, 3- and 4-marker haplotypes, HAP 1, HAP8, HAP70, HAP7 1 , HAP75, 
HAP76, HAP288, HAP290 and HAP291, often comprised the biallelic marker 99-5919/21 5 
(Al 1) allele A. Furthermore, several 2-, 3- and 4-marker haplotypes, HAP7, HAP67, HAP69, 

20 HAP75, HAP287 AND HAP288, often comprised the biallelic marker 99-1 6038/1 1 8 (A56) 

allele G. 

Example 5b 

Association Study Between Schizophrenia And The Biallelic Markers Of The Invention 

25 Collection Of DNA Samples From Affected And Non- Affected Individuals 

Biallelic markers of the invention were further analyzed in the French Canadian 
population described above. For this analysis, the proband case population under study 
consisted of 139 individuals, the control population consisted of 141 individuals, as described in 
Table 10 below. 
30 Table 10 



Cases and Control Populations Selected for the Association 
Study 






Sample type 


Cases 


Controls 


Sample size 




139 


141 


Gender 










Male 


94 


96 
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Female 


45 


45 


Familial history of psychosis (FH)* 








positive (FH+) 


76 


0 




none (FH-) 


63 


141 


* : close relatives (first or second degree) 







Genotyping of affected and control individuals 
A) Results from the genotyping 

The general strategy for performing the association studies was to individually scan the 
5 D1MA samples from all individuals in each of the populations described above in order to 

establish the allele frequencies of biallelic markers, and among them the biallelic markers of the 
invention, in the diploid genome of the tested individuals belonging to each of these 
populations. 

Allelic frequencies of every biallelic marker in each population (cases and controls) 
1 0 were determined by performing microsequencing reactions on amplified fragments obtained by 

genomic PCR performed on the DNA samples from each individual. Genomic PCR and 
microsequencing were performed as detailed above in Examples 1 to 3 using the described PCR 
and microsequencing primers. 

Single biallelic marker frequency analysis 
15 For each allele of the biallelic markers included in this study, the difference between the 

allelic frequency in the unaffected population and in the population affected by schizophrenia 
was calculated and the absolute value of the difference was determined. The allelic frequencies 
of between the affected and the unaffected population in the regions is set forth in Table 1 1 , 
using the sample population described above and in Table 10. The more the difference in allelic 
20 frequency for a particular biallelic marker or a particular set of biallelic markers, the more 

probable an association between the genomic region harboring this particular biallelic marker or 
set of biallelic markers and schizophrenia. Allelic frequencies were also useful to check that the 
markers used in the haplotype studies meet the Hardy- Weinberg proportions (random mating). 

Table 11 

25 Allelic frequencies of markers in differents sub-samples (%) 



Marker 


polymorphism 


Cases 


All controls 


Ail cases 


HF+ 


HF- 


99-20978/89 


C/G 


50,37 


47,26 


54,03 


55,43 


99-20983/48 


A/G 


30,37 


28,67 


32,5 


26,52 
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10 



99-15880/162 


A/G 




65,11 


oJ,I6 


67,46 


56,43 


99-25940/182 


A/U 


59,42 


56,67 


62,7 


52,59 


99-16032/292 
99-16033/244 - 


A/C 

err 


54 51 


61,84 
jo,76 


66,67 
51,69 


55,67 

56,44 ~~ 


99-15875/165 


C/T 




57,89 


55,65 


66,3 


99-16047/115 


C/T 




74,67 


68,03 


75,19 


99-25993/367 


A/G 




40,79 


49,18 


40,51 


99-25989/398 


A/G 


%0 ft! 


33,33 


32,2 


27,86 


99-25979/93 


A/G 


oo,j z 


69,08 


66,94 


69,32 


99-25969/200 


G/T 


.50,0/ 


38,67 


34,17 


38,85 




99-25966/241 


A/G 


66,3 


67,11 


65,32 


63,21 


99-25961/376 


A/C 




42,57 


36,07 


37,31 


99-25965/399 


A/G 




51,97 


48,39 


49,64 


99-25977/311 


A/G 




67,76 


77,59 


73,72 


99-25950/121 


C/G . 




36 


26,61 


27,54 


99-25974/143 


A/G 




28,29 


22,13 


22,7 


99-26150/276 
99-15258/337 


A/G 

g/t 




51,43 
26,97 


40,83 
23,77 


47,76 


99-15261/202 


A/G 




59,46 


67,5 


65,15 


99-15256/392 


C/T 


64,96 


61,33 


69,35 


65,3 


99-15056/99 


C/T 


' 32,72 


36,49 


28,23 


31,11 


99-15280/432 


C/T 


42,28 


44 


40,16 


38,97 


99-15355/150 


C/T 


72,3 


70,39 


74,6 


68,79 


99-15253/382 


C/T 


63,04 


62,67 


63,49 


62,95 


99-5873/159 


C/T 


78,3 


79,05 


76,98 


77,34 









Haplotype frequency analysis 

Analysis of markers Haplotype analysis for association of chromosome 13q31-q33- 
related bialle.ic markers and schizophrenia was performed by estimating the frequencies of all 
po SS1 ble 2, 3 and 4 marker haplotypes in the affected and control populations described above 
Haplotype estimations were performed by app l ying the Expectation-Maximization (EM) 
algonthm (Excoffier and Slatkin, 1995), using the EM-HAPLO program (Hawley et al 1994) 
as described above. Estimated haplotype frequencies in the affected and control population 
were compared by means of a chi-square statistical test (one degree of freedom). 

Haplotype association results in ™h~ OP hreni a ca*** 

Haplotype studies yielded significant results indicating an association of the nucleotide 
sequences of the invention with schizophrenia. Significant resu,ts are shown in Figures 5 and 6, 
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including descriptions of the frequency of the haplotype leading to the maximum chi square test 
(reference no. (1 ) in figures), the test of the frequency of a particular haplotype in cases vs in 
controls (reference no. (2) in figures) and the p- value assuming that the test has a chi-square 
distribution with I degree of freedom (ddl) (reference no. (3) in figures). The results of the 
5 haplotype analysis using 28 preferred biallelic markers of the invention, 99-24656-260 (A5), 

99-24639-163 (A6), 99-24634-108 (A7), 99-7652-162 (A8), 99-16100-147 (A9), 99-5862-167 
(A10), 99-5919-215 (Al 1), 99-24658-410 (A12), 99-24644-194 (A19), 99-5897-143 (A47), 99- 
24649-186 (A48), 99-16038-118 (A56), 99-15880-162 (A58), 99-25940-182 (A64), 99-16032- 
292 (A9), 99-16033-244 (A67), 99-15875-165 (A68), 99-16047-1 15 (A70), 99-25950-121 

10 (A105), 99-25961-376 (A106), 99-25965-399 (A103), 99-25966-241 (A 108), 99-25969-200 

(AllO), 99-25974-143 (Al 12), 99-25977-311 (A113), 99-25979-93 (Al 14), 99-25989-398 
(Al 16), and 99-26150-276 (Al 1 8) are shown in Figures 5 and 6. Figures 5 and 6 also show the 
physical order of the biallelic markers comprising the haplotypes. 

Figure 5 shows the results of the haplotype analysis using the following biallelic 

1 5 markers located on the approximately 3 1 9kb sequence of SEQ ID No. 1 : 99-24656-260 ( A5), 

99-24639-163 (A6), 99-24634-108 (A7), 99-7652-162 (A8), 99-16100-147 (A9), 99-5862-167 
(A10), 99-5919-215 (All), 99-24658-410 (A12), 99-24644-194 (A19), 99-5897-143 (A47), 99- 
24649-186 (A48), 99-16038-1 18 (A56), 99-15880-162 (A58), 99-25940-182 (A64), 99-16032- 
292 (A65), 99-16033-244 (A67), 99-15875-165 (A68), and 99-16047-115 (A70). 

20 Figure 6 shows the results of the haplotype analysis using the following biallelic 

markers located on the approximately 3 19kb of SEQ ID No. las well as additional biallelic 
markers located on the human chromosome 13q31-q33 locus: 199-16038-118 (A56), 99-15880- 
162 (A58), 99-25940-182 (A64), 99-16032-292 (A65), 99-16033-244 (A67), 99-15875-165 
(A68), 99-16047-115 (A70), 99-25950-121 (Al 05), 99-25961-376 (A106), 99-25965-399 

25 (A102), 99-25966-241 (A108), 99-25969-200 (Al 10), 99-25974-143 (Al 12), 99-25977-31 1 

(Al 13), 99-25979-93 (A114), 99-25989-398 (A 11 6), and 99-26150-276 (A 11 9). 

A number of biallelic marker haplotypes were shown to be significantly associated with 
schizophrenia. 

Several preferred haplotype all showing highly significant association with 
30 schizophrenia and including various 2-, 3- and 4- marker haplotypes are haplotypes 817, 818 

and 819, 137, 138, 1 and 2 of Figure 6, and haplotypes 970, 1 54 and 1 of Figure 5. The p- 
values, odd-ratios and estimated haplotype frequencies are further described in Figures 5 and 6. 
In particular, the two marker haplotype 1 of Figure 5 consisting of biallelic markers 99-5862- 

167 (A10) and 99-15875-165 (A68) showed a highly significant p-value of 7.8xl0" 5 and an 
35 odd-ratio of 1,61. Haplotype 818 of Figure 6 consisting of four biallelic markers (99-16032-292 
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(A65), 99-25969-200 (Al 10), 99-25977-31 1 (Ai 13), and 99-25989-398 (Al 16)) presented a p- 
-7 

value of 3.1x10 and an odd-ratio of 9.08, Another example showing significance is haplotype 
817 ofFigure 6-consisting of four biallelic markers (99-16033-244 (A67), 99-15875-165 (A68), 

99-25950-121 (A105) and 99-25979-93 (Al 14)), presented a p-value of 2.4xl0~ 7 and an odd- 
5 ratio of 100. Phenotypic permutation tests confirmed the statistical significance of these results. 

Estimated haplotype frequencies were 10.5% in cases and 0 % in the controls. Haplotype 970 of 
Figure 5 consisting of four biallelic markers (99-5919-215 (All), 99-24658-410 (A12), 99- 

15875-165 (A68), and 99-16047-1 15 (A10)) presented a p-value of 7.8xl0" 7 and an odd-ratio 
of 2.41 . Phenotypic permutation tests confirmed the statistical significance of these results. 

10 Estimated haplotype frequencies were 25.7% in cases and 12.5 % in the controls. 

Several other significant hapiotypes are listed in Figures 5 and 6, including several 2-, 
3- and 4-marker hapiotypes. Considered to be highly significantly associated with 
schizophrenia are the most significant 2-marker hapiotypes (for example haplotype 1 of Figure 
5 noted above and the most significant 3-marker hapiotypes (for example haplotype 137 of 

15 Figure 6 consisting of biallelic markers (99-15875-165 (A68), 99-16047-1 15 (A70) and 99- 

25950-121 (A105)). 

Further preferrred significant hapiotypes considered associated with schizophrenia are 
hapiotypes having p-values above a desired threshold level; all the hapiotypes listed in Figures 5 
and 6 present p-values below 1 .0x1 0" 2 for 2-marker hapiotypes, 1 .OxlO" 4 for 3-marker 

20 hapiotypes, and 1 .OxlO* 5 for 4-marker hapiotypes. Figures 5 and 6 show several 2-marker 

hapiotypes, hapiotypes 1 to 9 and hapiotypes 1 to 5 of Figures 5 and 6 respectively, having p- 
values ranging from 7.8xl0~ 5 to 8.6xl0' 3 , several 3-marker hapiotypes, hapiotypes 154 to 163 
and 137 to 141 of Figures 5 and 6 respectively, having p-values ranging from 3.9x10** to 
l.ixlO* 4 and several 4-marker hapiotypes, hapiotypes 970 to 973 and 817 to 836 of Figures 5 

25 and 6 respectively, having p-values ranging from 2.4x1 0" 7 to 7,3x1 O* 6 . 

Additionally, a particularly large number of the significant 2-, 3- and 4-marker 
hapiotypes often comprised the biallelic markers A65, A12, A67, A70, A107, Al 10, Al 16 and 
most commonly A68 (99-15875-165), allele T. 

The statistical significance of the results obtained for the haplotype analysis was 

30 evaluated by a phenotypic permutation test reiterated 1 00 times on a computer. For this 

computer simulation, data from the affected and control individuals were pooled and randomly 
allocated to two groups which contained the same number of individuals as the case-control 
populations used to produce the data summarized in figures 5 and 6. A haplotype analysis was 
then run on these artificial groups for the markers included in the hapiotypes showing strong 
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association with schizophrenia. This experiment was reiterated 100 times and the results are 
shown in the columns of Figures 5 and 6 labelled "Haplotype test by permutation procedure". 
For a given haplotype, these results demonstrate the number of obtained (simulated) haplotypes 
having a p-value comparable to the one obtained for the given haplotype among 100 iterations. 
5 These results, set forth in Figures 5 and 6 validate the statistical significance of the association 

between the haplotypes and schizophrenia . 

Example 5c 

Association Study Between Schizophrenia and the Biallelic Markers of the Invention in 

French Canadian Samples 
10 Collection Of DNA Samples From Affected And Non-Affected Individuals 

BiaileHc markers of the present invention were further genotyped in French Canadian 
samples as described above in order to compare the association of the 1st and the 2nd portion of 
Region D with schizophrenia. The population used in the study was the same as described 
above with the exception that 2 male FH+ cases were not included. 

1 5 The biallelic markers analyzed in the study include 34 preferred biallelic markers of the 

invention located in Region D of the chromosome 13q31-33 region. Included in the analysis 
were the 14 following biallelic markers from the first of two portions of Region D: 99- 
26150/276 (A118), 99-26156/290 (A120), 99-26153/44 (AI19), 99-25985/194 (Al 15), 99- 
25974/143 (Al 12), 99-25977/31 1 (A114), 99-25972/317 (Al 11), 99-25965/399 (Al 07), 99- 

20 25961/376 (A106), 99-25966/241 (A108), 25967/57 (A109), 99-25969/200 (Al 10), 99- 

25979/93 (Al 14) and 99-25989/398 (Al 16). Included in the analysis were also the 20 
following biallelic markers from the second of two portions of Region D: 99-25993/367 (A71), 
99-16047/115 (A70), 99-15875/165 (A68), 99-16033/244 (A67), 99-16032/292 (A65), 99- 
25940/182 (A64), 99-15880/162 (A58), 99-16038/1 18 (A56), 99-15870/400 (A53), 99- 

25 24649/186 (A48), 99-5897/143 (A47), 99-24644/194 (A 19), 99-24658/410 (A12), 99-5919/215 

(Al 1), 99-5862/167 (A10), 99-16100/147 (A9), 99-7652/162 (A8), 99-24634/108 (A7), 99- 
24639/163 (A6) and 99-24656/260 (A5). 

Single biallelic marker association results in schizophrenia cases 

Single biallelic marker studies yielded significant results, indicating an association of 

30 the nucleotide sequences of the invention with schizophrenia. Biallelic markers used in the 

analysis included the set of 34 biallelic markers shown in Table 11 below, 14 biallelic markers 
of which were located on the first of two portions of Region D, and 20 of which were located on 
the second portion. The distribution of markers in shown in Table 12 below. As summarized in 
Table 13 , analyses using these biallelic markers demonstrated a significant association with 

35 schizophrenia for 5 markers on the second portion of Region D. 
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Table 11 




Table 12 



Region 


No. ofBiallelic 
markers (a) 


Mean frequency 
(<*) 


Mean 

inter-marker 
distance (a) 


D I st half 


14(14) 


0.34 (0.07) 


7(6.3) 


D 2 nd half 


20 (8) 


0.42 (0.06) 


11 (13) 


Dl 5C and 2 M half 


34 (22) 


0.39 (0.07) 


10.3(11) 



Haplotype frequency analysis 
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Analysis of markers Haplotype analysis for association of chromosome 13q31-q33- 
related biallelic markers and schizophrenia was performed by estimating the frequencies of all 
possible 2, 3 and 4 marker haplotypes in the affected and control populations described above. 
Haplotype estimations were performed by applying the Expectation-Maximization (EM) 
5 algorithm (Excoffier and Slatkin, 1 995), using the EM-HAPLO program (Hawley et al., 1 994) 

as described above. 

Haplotype association results in schizophrenia cases 

Significant results were also obtained in haplotype studies indicating an association of 
the nucleotide sequences of the invention with schizophrenia. 

1 0 The present inventors having previously demonstrated highly significant association of 

biallelic markers located on the Region D subregion of the human chromosome 13q31-q33 
locus with disease. Using the Omnibus LR test which compares the profile of haplotype 
frequencies, and Haplo-maxM test which is based on haplotype differences for each haplotype 
in two groups, Figures 7 and 8 describe the results of an analysis of the first and second portions 

1 5 of Region D which demonstrated an association of the second portion of Region D with 

schizophrenia. 

For combinations of 2 and 3 biallelic markers, one likelihood ratio test is obtained based 
on the haplotype frequency values calculated using the E^M algorithm, A permutation 
procedure was used, where data from the affected and control individuals was pooled and 

20 randomly allocated to two groups which contained the same number of individuals as the case- 

control populations used to produce the data. A haplotype analysis was then run on these 
artificial groups for the markers included in the haplptypes showing strong association with 
schizophrenia. This experiment was reiterated 100 times. For a given haplotype, these results 
demonstrate the number of obtained (simulated) haplotypes having a p-value comparable to the 

25 one obtained for the given haplotype among 100 iterations. 

Figure 7 shows a comparison of the LR test value distributions of haplotype 
frequencies in the two portions of Region D. This association of the second portion of Region 
D with schizophrenia is shown using both 2-marker and 3-marker combinations. The 
distribution of LR test values in the different regions was analyzed using a Kruskal-Wallis rank 

30 test, a chi-square test with r-1 degrees of freedom, where r represents the number of value sets 

compared. As shown, the significance of the association is demonstrated by a chi-square value 

(one degree of freedom) of 74.405 and a p-value of less than lxl 0" 10 for 2 marker 

combinations, and a chi-square value (one degree of freedom) of 228.72 and a p-value of lxl 0" 

* ® for 3- marker combinations. 
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Another association analysis approach based on haplotype frequency differences, 
referred to as the Haplo-maxM test, was conducted using region D biallelic markers. For one 
combination of markers having h haplotypes, h differences of haplotype frequencies can be 
compared via a Pearson chi-square statistic (one degree of freedom). The haplo-max test selects 
the difference showing the maximum positive test value between cases versus controls 
(rejecting test values based on rare haplotype frequencies, i.e, with an estimated number of 
haplotypes inferior to 10); for one combination of markers there is therefore one Max-M test 
value. The results of the Haplo-maxM test using Region D biallelic markers are shown in 
Figure 8. 

Figure 8 shows the distribution of haplo-maxM test values obtained for both 2-marker 
and 3-marker combinations in the two portions of Region D, demonstrating an association of 
the second portion of Region D with schizophrenia. The comparison of the distribution of 
Haplo-maxM test values oin the two regions was analyzed using a Kruskal-Wallis rank test, a 
chi-square test with r-1 degrees of freedom, where r represents the number of value sets 
compared. As shown, the significance of the association is demonstrated by a chi-square value * 

o 

(one degree of freedom) of 34.839 and a p-value of less than 3.58x1 0 for 2 marker 
combinations, and a chi-square value (one degree of freedom) of 13.773 and a p-value of 

2.6x10"^ for 3- marker combinations. 

The results from the haplo-maxM tests further confirms the association shown using the 
Omnibus LR test results. 

Results of association studies discussed above using biallelic markers of the invention 
are further summarized in Table 13 below, showing a significant association of the biallelic 
markers with schizophrenia in both single biallelic marker and haplotype analysis. 



Table 13 





Single-point Analysis 


Multi-point analysis 
(Haplotype-based analysis) 




No. of allelic freq 
differences > 10% 


No. Significant 
allelic tests 


Omnibus LR TEST * 








2-mks 


3-mks 


4-mks 


Region D, 1st 
portion 


0 


0 


0,03 


0,05 


0,06 


Region D, 2nd 
portion 


0 


5 


0,30 


0,30 


0,31 



* percentage of significant tests (5% level of significance) 
Cases (N=213) / Controls (N-241) 



Example 5d 

Association Study Between Bipolar Disorder and the Biallelic Markers of the Invention 
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Description of study design 

Biallelic markers of the invention were analyzed in bipolar disorder cases. As in 
examples above, single and multi-point analyses showed a significant association of the markers 
of the invention, of Region D of the chromosome 13q33 locus, and more particularly of a sub- 
5 region of Region D with bipolar disorder. 

A) Description of the Affected population 

All the samples were collected from a study of bipolar disorder undertaken in a hospital 
located south of Buenos Aires, Argentina, generally representing a population estimated at 
about 400,000 inhabitants. Patients were evaluated by four doctors in 1994 and 1995. The 

10 study design involved in the ascertainment of cases and their first degree relatives (parents or 

siblings). 514 individuals were available for the study. This group consisted of 1 58 subjects 
from 51 different families, and 356 independent subjects. 

As a whole, bipolar disorder cases were ascertained according to the diagnosis of 
bipolar disorder established by the DSM-IV (Diagnostic and Statistical Manual, Fourth edition, 

15 Revised 1994, American Psychiatric Press); 

Available for consideration for each coded case were also age, sex, nationality of 
parents and grand parents, ethnic origin, familial composition, marital state, socio-economic 
level, educational level, professional situation, employment, receational activities, age of onset 
of phychiatric symptoms, age of first consultation, occurrences of obstetric or prenatal 

20 incidents, suicide attempts, other medical conditions, treatment for or occurrence of a 

neurological condition, familial occurrence of symptoms, previous or concurrent use of 
psychotropic drugs, other admissions to a hospital or medical treatments, and diagnostic reason 
for admission including (a) DSM-IV diagnosis and (b) symptoms first presented on admission 
to hospital. 

25 Available for study were 226 bipolar disorder ascertained cases of which 203 were 

independent cases. This group consisted of 5 1 cases from 5 1 families, 20 cases in relatives 
thereof, and 155 independent cases. Upon elimination of 3 cases from the initial independent 
1 55 cases due to discovery of a familial relation, the total number of independent cases was 203. 
Cases were classified according to bipolar disorder type. The cases included 1 1 5 

30 bipolar disorder type I individuals (including 1 rapid cycling case), 67 bipolar disorder type II 

individuals (including 1 rapid cycling case), 1 8 unclassified bipolar disorder cases, and 3 cases 
which remained unclassified due to lack of or inconsistent information. 

The 203 independent cases were examined for a familial history of psychosis. 53 of 
these cases reported an occurrence of psychosis (characterized as schizophrenia or bipolar 

35 disorder) among first degree relatives (father, mother, brothers, sisters or children). 
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B) Decription of the Unaffected population 

Available for study were 201 controls which had not been affected by any psychiatric 
difficulties or reported any familial history of psychiatric difficulties. Available for 
consideration were also age, sex and ethnic origin of the unaffected population. 
5 C) Case and Control Populations Selected for the Association Study 

For the association study, the case population under study consisted of 20 1 individuals 
selected from the 226 total cases above; the control population consisted of 198 individuals 
selected from the 201 controls described above. 

The association data that are presented in the Example 5d below were obtained on a 
10 population size detailed in Table 14 below. 

Table 14 



Cases and Control Populations Selected for the Association 
Study 






Sample type 


Cases 


Controls 


Sample size 




201 


198 


Gender 










Male 


68 


SI 




Female 


124 


117 




Missing 


9 




Ethnic origin 








Causasian 


182 


177 




Non Caucasian 


5 


21 




Missing 


14 




Familial history of psychosis (FH)* 








positive (FH+) 


54 


0 




none (FH-) 


147 


198 


* : close relatives (first degree) 







Both case and control populations form two groups, each group consisting of unrelated 
individuals that do not share a known common ancestor. 
1 5 Genotvptng of affected and control individuals 

The general strategy was to individually scan the DNA samples from all individuals in 
each of the populations described above in order to establish the allele frequencies of biallelic 
markers, and among them the biallelic markers of the invention, in the diploid genome of the 
tested individuals belonging to each of these populations. 
20 Allelic frequencies of every biallelic marker in each population (cases and controls) 

were determined by performing microsequencing reactions on amplified fragments obtained by 
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genomic PCR performed on the DNA samples from each individual. Genomic PCR and 
microsequencing were performed as detailed above in Examples 1 to 4 using the described PCR 
and microsequencing primers. 

Association analysis 

5 The association analysis included 30 preferred biallelic markers of the invention located 

in Region D of the chromosome 13q3 1-33 region. Included in the analysis were the 14 
following biallelic markers from the first of tyvo subjective portions of Region D: 99-26 1 50/276 
(Al 18), 99-26156/290 (A120), 99-26153/44 (Al 19), 99-25985/194 (Al 15), 99-25974/143 
(A112), 99-25977/311 (A113), 99-25972/317 (Alll), 99-25965/399 (A107), 99-25961/376 

10 (A106), 99-25966/241 (A108), 25967/57 (A109), 99-25969/200 (A3 10), 99-25979/93 (A114) 

and 99-25989/398 (Al 16). Included in the analysis were also the 16 following biallelic markers 
from the second of two portions of Region D: 99-25993/367 (A7 1), 99-1 6047/1 1 5 (A70), 99- 
15875/165 (A68), 99-16033/244 (A67), 99-16032/292 (A66), 99-25940/182 (A65), 99- 
15880/162 (A58), 99-16038/118 (A56), 99-15870/400 (A53), 99-24649/186 (A48), 99- 

15 5897/143 (A47), 99-24644/1 94 (Al 9), 99-5919/215 (All), 99-5862/167 (A 10), 99-16100/147 

(A9), and 99-7652/162 (A8). 

A) Single biallelic marker association results in bipolar disorder cases 
For each allele of the biallelic markers included in this study, the difference between the 
allelic frequency in the unaffected population and in the population affected by bipolar disorder 

20 was calculated and the absolute value of the difference was determined. The set of biallelic 

markers and their allelic frequencies included in this study are set forth in Table 15, The more 
the difference in allelic frequency for a particular biallelic marker or a particular set of biallelic 
markers, the more probable an association between the genomic region harboring this particular 
biallelic marker or set of biallelic markers and bipolar disorder. Allelic frequencies were also 

25 useful to check that the markers used in the haplotype studies meet the Hardy- Weinberg 

proportions (under random mating assumptions) 



Table 15 



REGION 


CONTIG 


POSITION 
ON CONTIG 


SNPS GENOTYPED 


POLYMORPHISM 


FREQUENCY 
IN CONTROLS 






168,02 


99-26150/276 


A/G 


62,93 






173,29 


99-26156/290 


A/C 


72,42 






177,01 


99-26153/44 


A/C 


52,66 






186,41 


' 99-25985/194 


C/T 


28,87 






190,15 


99-25974/143 


A/G 


31,79 






216,43 


99-25977/311 


A/G 


63,82 



Region D 
first Half 
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10 







224,62 


99-25972/317 


C/T 


72,32 






236,64 


99-25965/399 


A/G 


58,24 








99-25961/376 


A/C 


44,35 






04, /U 


99-25966/241 


A/G 


66,18 






Of! Of 

257,85 


99-25967/57 


A/G 


42,44 






261,23 


99-25969/200 


G/T 


35,76 






263,67 


99-25979/93 


A/G 


67,15 






269,39 
" "299,62" " 


99-25989/398 
9T-25993/367 


A/G 
A/G 


35,88 






303,04 


99-16047/115 


C/T 


69,01 






335,02 


99-15875/165 


c/r 


61,3 






354,81 


99-16033/244 


c/r 


50,3 






366,51 


99-16032/292 


A/C 


62,87 






3.67,14 


99-25940/182 


A/G 


54,39 






372,98 


99-15880/162 


A/G 


62,72 






375,28 


99-16038/118 


A/G 


37,29 






383,41 


99-15870/400 


A/G 


29,65 






394,16 


99-24649/186 


C/T 


66,57 






395,27 


99-5897/143 




52,6 






,409,93 


99-24644/194 


A/G 


38,29 






424,95 


99-5919/215 


A/G 


60,63 






441,62 


99-5862/167 


C/T 


46,53 






444,00 


99-16100/147 


A/G 


48,84 






445,84 


99-7652/162 


A/G 


49,7 


1UTAL | 


30 







(1) : frequency (%) in Caucasian controls (N-177) of the first allele (alphabetic order) 
Region D was arbitrarily split in two halves (D 1 s * half and D 2 nd half) for purpose of the 

analysis. 

The present inventors have previously demonstrated significant association of biallelic 
markers located on the Region D subregion of the human chromosome 13q31-33 region with 
disease. Using a set of 30 biallelic markers shown in Table 15, D 1* half contained 14 markers 
and D 2 nd half contained 16 markers. 

Table 15 also shows the physical order of the biallelic markers on Region D of the 
human chromosome 13q31-q33 region. The mean intermarker distances of the biallelic markers 
on the first and the second subjective portions of Region D were as listed below in Table 16. 

Table 16 
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Region 


Mean 

Inter-marker distance (std) 


D l sl ha!f 


7.80(6.33) 


D2 Rd half 


9.79 (8.78) 


D 1 st and 2 nd half 


9.58 (8.46) 



The analysis using selected Region D biallelic markers of the invention demonstrated a 
significant association with bipolar disorder for the second portion of Region D. The analysis 
was conducted using the sample population described above with 182 Caucasian cases and 177 
5 Caucasian controls selected from the total case and control group. 

One biallelic marker in particular, 99-1 5875/1 65(A68), located on the second half of 
Region D, demonstrated a significant association with disease at a significance level of better 
than 5% (corresponding to an absolute logarithm (p-value) of 1 .3). 

B) Haplotype association results in bipolar disorder cases 
10 Haplotype analysis for association of chromosome 13q31-q33-related biallelic markersand 

bipolar disorder was performed by estimating the frequencies of all possible 2, 3 and 4 marker 
haplotypes in the affected and control populations described above. Haplotype frequencies 
estimations were performed by applying the Expectation-Maximization (EM) algorithm (Excoffier 
and Slatkin, 1995), modified by Nicholas Schork. 
15 Significant results were obtained in haplotype studies indicating an association of the 

nucleotide sequences of the invention with bipolar disorder. The haplotype analysis as shown in 
the Figures 9 A, 9B, 10A, 10B, 1 1 A and 1 IB was conducted using the sample population 
described above, using 182 Caucasian cases and 177 Caucasian controls selected from the total 
case and control group. 

20 Using the Omnibus LR test which compares the profile of haplotype frequencies, and 

Hapto-maxM test which is based on haplotype frequencies differences for each haplotype in 
two groups, Figures 9 A, 9B, 10A, 10B, 1 1 A and 1 IB show the results of a comparison of the 
first and second portions of Region D which demonstrated an association of the second portion 
of Region D with bipolar disorder. 

25 a - Omnibus LR tests values 

For a given combination of 2, 3 or 4 biallelic markers, one likelihood ratio test (LR test) 
is obtained based on the haplotype frequencies values calculated using the E-M algorithm. 

Figures 9A and 9B show a comparison of the LRtest value distributions of haplotype 
frequencies in the two portions of Region D. This association of the second portion of Region 
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D with bipolar disorder is shown using both 2-marker and 3-marker combinations. A Kruskall 
Wallis rank test was used to compare LR test values distributions in the two subjective portions 
of Region D, This test has an asymptotic Chi-square distribution, under the null hypothesis of 
no difference between the sets compared, with (r-1) degrees of freeedom, where r represents the 
5 number of sets compared. Here, we compare the 2 portions of region D, so r=2, and the 

asymptotic Chi-square distribution has 1 degree of freedom. As shown, the significance of the 
association is demonstrated by a chi-square value (one degree of freedom) of 46.62 and a p- 
-12 

value of 8.62x1 0 for 2 marker combinations, and a chi-square value (one degree of freedom) 

-29 

of 1 24,72 and a p-value of 5.86x1 0 for 3- marker combinations. 

10 b - Hapio-max tests values 

Another association analysis approach based on haplotype frequencies differences, 
referred to as the Haplo-max test, was conducted using region D biallelic markers. The haplo- 
max test selects the difference showing the maximum positive (maxM) or negative (maxS) test 
value between cases versus controls (rejecting test values based on rare haplotype frequencies, 
15 i.e, with an estimated number of haplotypes carriers inferior to 10) ; for one combination of 

markers there is therefore one Max-M and one Max-S test values. 

Figures 10A and 10B show the distribution of haplo-maxM test values obtained for both 
2-marker and 3-marker combinations in the two portions of Region D, demonstrating an 
association of the second portion of Region D with bipolar disorder. The comparison of the 
20 distribution of Haplo-maxM test values in the two regions was analyzed using a Kruskal-Waliis 

rank test, a chi-square test with 1 degree of freedom. As shown, the significance of the 

-8 

association is demonstrated by a chi-square value of 29.07 and a p-value 6.98x1 0 for 2 marker 

-23 

combinations, and a chi-square value of 98.63 and a p-value of 3.04x1 0 for 3- marker 
combinations. 

25 Figures 1 1 A and 1 1 B show the distribution of Haplo-maxS test values again obtained 

for all 2-marker and 3-marker combinations in the two portions of Region D, demonstrating an 
association of the second portion of Region D with bipolar disorder. The comparison of the 
distributions of Haplo-maxS test values in the two portions was analyzed using a Kruskal- 
Wallis rank test with one degree of freedom. As shown, the significance of the association is 

-9 

30 demonstrated by a chi-square value of 34 .6 and a p-value of 4.05x 1 0 for 2 marker 

-23 

combinations, and a chi-square value of 98.31 and a p-value of 3.58x10 for 3- marker 
combinations. 

The results from the haplo-maxM and haplo-maxS tests thus further confirm the 
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association shown using the Omnibus LR test results. 

Example 5e 

Identification of Common Haplotypes in Three Schizophrenia Populations 

5 

Based on the results above indicating an association of biallelic markers located in . 
Region D3 with schizophrenia, further biallelic markers in Region D3 were screened in three 
different populations of schizophrenia cases and controls. The D3 region is an approximately 
150kb genomic region containing the g35O30 gene. 

10 Several significant haplotypes comprised of markers in or flanking the g35030 gene and 

common to the three populations were identified. These haplotypes demonstrated significance 
in Omnibus LR and individual haplotype frequency tests. Furthermore, the estimated 
frequencies of the haplotypes were highly similar for each of the different populations. 

The samples used were obtained from a French Canadian population, an Eastern 

15 European population, and a United States sample. The French Canadian schizophrenia sample is 

described above in Example 5a, and Tables 7 and 8. The United States schizophrenia sample is 
described in Table 17 below. The Eastern European sample, is described in Table 18 below. 



Table 17 



United States Schizophrenia Cases and Control Populations 
(United States Caucasians) 






Sample type 


Cases 


Random US 
Controls 


Sample size 




131 


188 


Ethnic origin 








European Causasians 
(26 female* 66 male) 


92 






Ashkenazi Caucasians 
(7 female, 17 female) 


24 






Other Caucasians 
(7 female, 8 male) 


15 




Familial history of psychosis (FH) 








positive (FH+) 


133 






none (FH-) 


147 


198 
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Table 18 



Eastern European (Rogaev) Schizophrenia Cases and 
Control Populations 






Sample type 


Cases 


Controls 


Sample size 




183 


183 


Ethnic origin: European Causasians 
(97 female, 86 male) 






Familial history of psychosis (FH) 








positive (FH+) 


55 






none (FH-) 


128 





For each of the three populations, multipoint analyses were carried out using biallelic 
5 markers from the D3 region. The haplotypic frequency differences between case and controls 

was examined, reported as MaxM for the maximum positive difference, and MaxS as the 
maximum negative difference, and the profile of haplotype frequency differences, where the 
distribution of haplotype frequency differences is described using the Omnibus LR test. 

Several significant two-marker haplotypes were found to be in common among the 
10 three populations. One haplotype, referred to haplotype 2, is described further in Tables 1 9 and 

20. Other haplotypes are comprised of biallelic markers which are in linkage disequilibrium 
with one of the biallelic markers of haplotype 2, and thus also provide significant results. 
Biallelic markers 99-15875-165 (A68) (allele T) and 99-5862-167, involved in haplotype 2, 
demonstrated a range of estimation of haplotype frequencies of 23.6 to 24.5 in cases, and 15,5 
15 to 17.6 in controls. Again, similar differences in frequency estimates were seen with other 

haplotypes. Markers 99-15875-165 (A68) (allele T) and 99-5862-167 are located in or flanking 
the g35030 gene. 

Table 19 shows the results of analysis of estimated haplotype frequencies for haplotype 
2, comprising biallelic markers 99-15875-165 (A68) (allele T) and 99-5862-167 (A10) (allele 

20 T). Presented are the results of a likelihood ratio test (LR test), demonstrating a significant p- 

value in each of the three populations; the results of the LR test on which a permutation 
procedure has been carried out (Omnibus), demonstrating a significant p-value of 5*10-3 for 
French Canadian samples, 8*10-2 for Eastern European (Rogaev) samples, and 1*10-2 for U.S. 
samples; and the estimated haplotype frequencies in cases and controls, and their positive or 

25 negative difference (MaxM). Notably, the haplotype frequencies in the three populations are 

highly similar at 23.60, 23.90 and 24.50 for cases and 1 5.5, 17.6 and 15.5 for controls. 
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Table 19 



HAP2: 

99-15875/165 (T), 99-5862/167 (T) 


POP 


haplotype 


Pvalue- 
hap 


Omnibus 
(1000 
perm) 


freq 
cas 


freq 
controls 


Difference 


French Canadian 


TT 

LDNormalized:-0.16) 


2,20E- 
03 


5,00E-03 


23,60 


15,5 


8,10 


axM 


Eastern European 


4,60E- 
02 


8,70E-02 


23,90 


17,6 


6,3 


axM 


U.S. 


8,60E- 
03 


l,30E-02 


24,50 


15,5 


9 


axM 



Example 6 

Preparation of Antibody Compositions to the g35030 protein 

Substantially pure protein or polypeptide is isolated from transfected or transformed cells 
containing an expression vector encoding the g35030 protein or a portion thereof. The 
concentration of protein in the final preparation is adjusted, for example, by concentration on an 
Amicon filter device, to the level of a few micrograrns/rnl. Monoclonal or polyclonal antibody to 
the protein can then be prepared as follows: 

A. Monoclonal Antibody Production by Hvbridoma Fusion 

Monoclonal antibody to epitopes in the g35030 protein or a portion thereof can be prepared 
from murine hybridomas according to the classical method of Kohler, G. and Milstein, C, Nature 
256:495 (1975) or derivative methods thereof. Also see Harlow, E., and D. Lane* 1988. 
Antibodies A Laboratory Manual Cold Spring Hatbor Laboratory, pp. 53-242. 

Briefly, a mouse is repetitively inoculated with a few micrograms of the g3 5030 protein or 
a portion thereof over a period of a few weeks. The mouse is then sacrificed, and the antibody 
producing cells of the spleen isolated. The spleen cells are fused by means of polyethylene glycol 
with mouse myeloma cells, and the excess unfused cells destroyed by growth of the system on 
selective media comprising aminopterin (HAT media). The successfully fused cells are diluted and 
aliquots of the dilution placed in wells of a microtiter plate where growth of the culture is 
continued. Antibody-producing clones are identified by detection of antibody in the supernatant 
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fluid of the wells by immunoassay procedures, such as ELISA, as originally described by Engvall, 
(1980), and derivative methods thereof. Selected positive clones can be expanded and their 
monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody 
production are described in Davis, L, et al. Basic Methods in Molecular Biology Elsevier, New 
5 York. Section 21-2. 

B. Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes in the g35030 
protein or a portion thereof can be prepared by immunizing suitable non-human animal with the 
g35030 protein or a portion thereof, which can be unmodified or modified to enhance 

10 immunogenicity. A suitable non-human animal is preferably a non-human mammal is selected, 

usually a mouse, rat, rabbit, goat, or horse. Alternatively, a crude preparation which has been 
enriched for g35030 concentration can be used to generate antibodies. Such proteins, fragments 
or preparations are introduced into the non-human mammal in the presence of an appropriate 
adjuvant (e.g. aluminum hydroxide, RIBI, etc.) which is known in the art. In addition the 

15 protein, fragment or preparation can be pretreated with an agent which will increase 

antigenicity, such agents are known in the art and include, for example, methylated bovine 
serum albumin (mBSA), bovine serum albumin (BSA) f Hepatitis B surface antigen, and 
keyhole limpet hemocyanin (KLH). Serum from the immunized animal is collected, treated and ' 
tested according to known procedures. If the serum contains polyclonal antibodies to undesired 

20 epitopes, the polyclonal antibodies can be purified by imrnunoaffinity chromatography. 

Effective polyclonal antibody production is affected by many factors related both to the 
antigen and the host species. Also, host animals vary in response to site of inoculations and 
dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. Small 
doses (ng level) of antigen administered at multiple intradermal sites appears to be most 

25 reliable. Techniques for producing and processing polyclonal antisera are known in the art, see 

for example, Mayer and Walker (1987), An effective immunization protocol for rabbits can be 
found in Vaitukaitis, J. et al. J. Clin. Endocrinol. Metab. 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when 
antibody titer thereof, as determined semi-quantitatively, for example, by double immunodiffusion 

30 in agar against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, 

O. et ah, (1973). Plateau concentration of antibody is usually in the range of 0.1 to 0.2 mg/ml of 
serum (about 12 |xM). Affinity of the antisera for the antigen is determined by preparing 
competitive binding curves, as described, for example, by Fisher, D., Chap. 42 in: Manual of 
Clinical Immunology, 2d Ed. (Rose and Friedman, Eds.) Amer. Soc. For Microbiol., Washington, 

35 D.C.(1980). 
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Antibody preparations prepared according to either the monoclonal or the polyclonal 
protocol are useful in quantitative immunoassays which determine concentrations of antigen- 
bearing substances in biological samples; they are also used semi-quantitatively or qualitatively to 
identify the presence of antigen in a biological sample. The antibodies may also be used in 
5 therapeutic compositions for killing cells expressing the protein or reducing the levels of the protein 

in the body. 

The disclosures of all issued patents, published PCT applications, scientific references or 
other publications cited herein are incorporated herein by reference in their entireties. 
10 Although this invention has been described in terms of certain preferred embodiments, 

other embodiments which will be apparent to those of ordinary skill in the art of view of the 
disclosure herein are also within the scope of this invention. Accordingly, the scope of the 
invention is intended to be defined only by reference to the appended claims. 



15 
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Sequence listing free text 

The following free text appears in the accompanying Sequence Listing: 

schizophrenia 

associated 

proteins 

biallelic 

markers 

regulatory 

region 

gene 

exon 

complement 
polymorphic base 
or 

allele 
deletion 
of 
- basic 
protease 
cleavage 
site 
probe 
primer 
downstream 
upstream 
amplification 
sequencing 
oligonucleotide 
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The disclosures of the following references are incorporated herein by reference in their 
entireties: 

Abbondanzo S.J. etal. (1993) Methods in Enzymology, Academic Press, Mew York. pp. 803- 
823. 

5 AjiokaR.S. etal. (1997) Am. J. Hum. Genet.60: 1439-1447. 

Aitschul et aL, 1990, J. Mol. Biol. 215(3):403-410; 
Altschul et ah, 1993, Nature Genetics 3:266-272 
Aitschul et al.,1997, "Nuc. Acids Res. 25:3389-3402 
Ames, R.S. et aL J. Immunol. Methods 184:1 77-186(1995) 
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1 . An isolated^urified or recombinant polynucleotide comprising a contiguous span of at least 
30 nucleotides selected from the group consisting of: 

5 (a) SEQ ID Nos. 2 to 1 7, and the complements thereof; 

(b) SEQ ID Nos 24, 25, 28, 74 and 79 to 1 32, and the complements thereof; and 

(c) at least one of nucleotide positions 199122 to 249803 of SEQ ID No 1. 

2. An isolated, purified or recombinant polynucleotide according to claim 1, wherein said 
10 contiguous span of SEQ ID No 1 or the complements thereof comprises at least one of the 

following nucleotide positions of SEQ ID No 1: 201 123 to 201234, 201 123 to 201560, 214676 
to 214793, 215702 to 215746, 216836 to 216994, 216836 to 217077, 217671 to 217764, 
227655 to 227736, 238715 to 238919, 240440 to 240673, 240440 to 241153, 241072 to 
241291, 244353 to 244561 and 246273 to 247802; 

3. An isolated, purified or recombinant polynucleotide according to claim 1 , wherein said span 
comprises a bialleiic marker selected from the group consisting of Al 3 to A 18, A20 to A46, 
A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 . 

20 4. A recombinant vector comprising a polynucleotide according to any one of claims 1 to 3. 

5. A host cell comprising a recombinant vector according to claim 4. 

6. A non-human host animal or mammal comprising a recombinant vector according to claim 
25 5. 

7. A mammalian host cell comprising a g35030 gene disrupted by homologous recombination 
with a knock out vector, comprising a polynucleotide according to any one of claims 1 to 3. 

30 8. A non-human host mammal comprising a g35030 gene disrupted by homologous 

recombination with a knock out vector, comprising a polynucleotide according to any one of 
claims 1 to 3. 



35 



9. Use of a polynucleotide comprising a contiguous span of at least 12 nucleotides of a sequence 
selected from the group consisting of SEQ ID Nos. 2 to 17, SEQ ID Nos 24, 25, 28, 74 and 79 
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to 132, at least one of nucleotide positions 199122 to 249803 of SEQ ID No 1, and the 
complementary sequences thereto, for determining the identity of the nucleotide at a biallelic 
marker selected from the group consisting of Al 3 to A 18, A20to A46, A49 to A52, A55, A57, 
A59 to A63, A72 to A73, A76 and A123 . 

5 

10* Use of a polynucleotide according to claim 9 in a microsequencing assay, wherein the 3' 
end of said contiguous span is located at the 3' end of said polynucleotide and wherein the 3' 
end of said polynucleotide is located 1 nucleotide upstream of a biallelic marker in said 
sequence. 

10 

11. Use of a polynucleotide according to claim 10 in a hybridization assay, wherein said span 
includes a biallelic marker. 

12. A polynucleotide according to any one of claims 1 to 3 attached to a solid support. 

15 

13. An array of polynucleotides comprising at least one polynucleotide according to claim 12. 

14. An array according to claim 13, wherein said array is addressable. 

20 15. A polynucleotide according to any one of claims 1 to 3 or 12 further comprising a label. 



16. A method of genotyping comprising determining the identity of a nucleotide at a biallelic 
marker selected from the group consisting of A13 to Al 8, A20 to A46, A49 to A52, ASS, A57, 
A59 to A63, A72 to A73, A76 and A 123 or the complement thereof in a biological sample. 

25 

17. A method according to claim 16, wherein said biological sample is derived from a single 
subject. 

18. A method according to claim 16, wherein said biological sample is derived from multiple 
30 subjects. 

19. A method according to claim 16, further comprising amplifying a portion of said sequence 
comprising the biallelic marker prior to said determining step. 



35 



20. A method according to claim 19, wherein said amplifying is performed by PGR. 
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21 . A method according to claim 16, wherein said determining is performed by a hybridization 
assay. 

5 22. A method according to claim 16, wherein said determining is performed by a sequencing 

assay. 

23. A method according to claim 16, wherein said determining is performed by a 
microsequencing assay. 

10 

24. A method according to claim 16, wherein said determining is performed by an enzyme- 
based mismatch detection assay. 

25. A method of estimating the frequency of an allele of a biallelic marker in a population 
15 comprising: 

a) genotyping individuals from said population for said biallelic marker according to the method 
of claim 16; and 

b) determining the proportional representation of said biallelic marker in said population. 

20 26. A method of detecting an association between a genotype and a trait, comprising the steps 

of: 

a) determining the frequency of at least one biallelic marker in trait positive population 
according to the method of claim 25 ; 

b) determining the frequency of at least one biallelic marker in a control population according to 
25 the method of claim 25; and 

c) determining whether a statistically significant association exists between said genotype and 
said trait. 

27. A method of estimating the frequency of a haplotype for a set of biallelic markers in a 
30 population, comprising: 

a) genotyping at least one biallelic marker according to claim 16 for each individual in 
said population; 

b) genotyping a second biallelic marker by determining the identity of the nucleotides at 
said second biallelic marker for both copies of said second biallelic marker present in the 

35 genome of each individual in said population; and 
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c) applying a haplotype determination method to the identities of the nucleotides 
determined in steps a) and b) to obtain an estimate of said frequency. 

28. A method according to claim 27, wherein said haplotype determination method is selected 
5 from the group consisting of asymmetric PCR amplification, double PCR amplification of 

specific alleles, the Clark method, or an expectation maximization algorithm. 

29. A method of detecting an association between a haplotype and a trait, comprising the steps 
of: 

10 a) estimating the frequency of at least one haplotype in a trait positive population according to 

the method of claim 27; 

b) estimating the frequency of said haplotype in a control population according to the method of 
claim 27; and 

c) determining whether a statistically significant association exists between said haplotype and 
15 said trait 



30. A method according to claim 26, wherein said genotyping of step a) is performed on each 
individual of said population. 

20 3 1 . A method according to claim 26, wherein said genotyping is performed on a single pooled 

biological sample derived from said population. 

32. A method of detecting an association between an allele and a phenotype, comprising the 
steps of: 

25 a) determining the frequency of at least one biallelic marker allele in a trait positive population 

according to the method of claim 25; 

b) determining the frequency of said biallelic marker allele in a control population according to 
the method of claim 25; and 

c) determining whether a statistically significant association exists between said allele and said 
30 phenotype. 

33. A method according to claim 26 or 29, wherein said trait is schizophrenia or bipolar 
disorder. 



35 



34. A method according to claim 26 or 29, wherein said trait is predisposition to schizophrenia 
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or bipolar disorder, an early onset of schizophrenia or bipolar disorder, or a beneficial response 
to or side effects related to treatment against schizophrenia or bipolar disorder. 

35. A method according to claim 39, wherein said phenotype is a symptom of schizophrenia or 
. 5 bipolar disorder. 

36. A method according to claim 26 or 29, wherein said control population is a trait negative 
population. 

10 37. A method according to claim 26 or 29, wherein said case control population is a random 

population. 

33. A method of determining whether an individual is at risk of schizophrenia or bipolar 
disorder, comprising: 

1 5 a) genotyping at least one biallelic marker according to the method of claim 1 6; and 

b) correlating the result of step a) with a risk of developing schizophrenia or bipolar disorder. 

39. A diagnostic kit comprising a polynucleotide according to any one of claims 1 to 3 or 12. 

20 40. A purified or isolated g35030 polypeptide which is encoded by a nucleic acid comprising a 

nucleotide sequence selected from the group consisting of SEQ ID Nos 2 to 17, and fragments 
or variants thereof. 

41. A purified or isolated g35030 polypeptide comprising at least 6 contiguous amino acid 
25 residues of any of SEQ ID Nos 1 8 to 23. 

42. A purified or isolated g35030 polypeptide according to claim 41, comprising at least one 
amino acid substitution, addition or deletion. 

30 43. A method for producing a g35030 polypeptide, wherein said method comprises the 

following steps: 

a) providing a cell host comprising a recombinant vector according to claim 4 containing a 
nucleic acid encoding a g35030 polypeptide; 

b) recovering the g3 5030 polypeptide produced by said recombinant cell host. 

35 
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44. The method according to claim 43, wherein the recombinant eel! host is a recombinant cell 
host according to claim 5. 

45. An isolated or purified antibody composition capable of selectively binding to a polypeptide 
according to claim 40 to 42. 

46. A method for specifically detecting the presence of a g35030 polypeptide in a biological 
sample, said method comprising the following steps : 

a) bringing into contact the biological sample with an antibody directed against a g35030 
polypeptide according to any one of claims 40 to 42; 

b) detecting the antigen-antibody complex formed between said antibody and said polypeptide. 

47. A diagnostic kit for detecting in vitro the presence of a g35030 polypeptide in a biological 
sample, said kit comprising: 

a) a polyclonal or monoclonal antibody directed against a g35030 polypeptide according to any one 
of claims 40 to 42 or a fragment thereof, optionally labeled; 

b) a reagent allowing the detection of the antigen-antibody complexes formed between said g35030 
polypeptide and an antibody. 

48. A method for the screening of a candidate substance, wherein said method comprises the 
following steps: 

a) providing a polypeptide according to anyone of claims 40 to 42; 

b) obtaining a candidate substance; 

c) bringing into contact said polypeptide with said candidate substance; 

d) detecting the complexes formed between said polypeptide and said candidate substance. 

49. The method of claim 48, wherein at step d), the complexes formed are incubated in the 
presence of a polyclonal or a monoclonal antibody according to claim 45 . 

50. A kit for screening a candidate substance interacting with a g35030 polypeptide, wherein 
said kit comprises: 

a) a polypeptide according to anyone of claims 40 to 42; 

b) optionally a monoclonal or a polyclonal antibody according to claim 45. 



5 1 . A method for the screening of a candidate substance, where said method comprises the 
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following steps: 

a) cultivating a prokaryotic or an eukaryotic cell that has been transfected with a nucleotide 
sequence encoding a g35030 protein or a variant or a fragment thereof, placed under the control 
of its own promoter; 
5 b) bringing into contact the cultivated cell with a molecule to be tested; 

c) quantifying the expression of the g35030 protein or a variant or a fragment thereof 

52 A computer readable medium having stored thereon a sequence selected from the group 
consisting of a nucleic acid code comprising one of the following: 
10 a) a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 1 50, 

200, 500, 1000 or 2000 nucleotides of SEQ ID No. 1, and the complements thereof, wherein 
said contiguous span comprises at least one of the following nucleotide positions of SEQ ID No 
1: 201123 to 247802 and 199122 to 249803; 

b) a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 1 50, 
1 5 200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 2 to 1 7, and the complements 

thereof, to the extent that such a length is consistent with the particular sequence ID; 

c) a contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90, 100 or 200 nucleotides, to the extent that such a length is consistent with the particular 
sequence ID, Of SEQ ID Nos. 24, 25, 28, 74 and 79 to 132, or the complements thereof; 

20 d) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 80, 

90 or 100 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 
span comprises at least one of the following nucleotide positions of SEQ ID No 1 : 201 123 to 
201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 

25 240440 to 241153, 241072 to 241291, 244353 to 244561 and 246273 to 247802; 

e) a contiguous span according to a), b), c) or d), wherein said span includes a biallelic 
marker selected from the group consisting of A 1 to A 1 27; 

f) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90 T 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID No 1, and the complements thereof, 

30 wherein said span includes a polymorphism selected from the group consisting of A 1 3 to A 1 8, 

A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123 ; and 

g) a nucleotide sequence complementary to any one of the contiguous spans of a), b), c), 
d), e), f), and g). 

35 53. A computer readable medium having stored thereon a sequence consisting of a polypeptide 
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code comprising a contiguous span of at least 6 amino acids of a polypeptide sequence selected 
from the group consisting of SEQ ID Nos. 1 8 to 23. 

54. A computer system comprising a processor and a data storage device wherein said data 

5 storage device comprises a computer readable medium according to any one of claims 52 or 53. 

55. A computer system according to claim 54, further comprising a sequence comparer and a 
data storage device having reference sequences stored thereon. 

10 56. A computer system of claim 55 wherein said sequence comparer comprises a computer 

program which indicates polymorphisms. 

57. A computer system of claim 55 further comprising an identifier which identifies features in 
said sequence. 

58. A method for comparing a first sequence to a reference sequence, comprising the steps of: 

a) reading said first sequence and said reference sequence through use of a computer 
program which compares sequences; and 

b) determining differences between said first sequence and said reference sequence with said 
20 computer program, 

wherein said first sequence is selected from the group consisting of a nucleic acid code 
comprising one of the following: 

(i) a contiguous span of at least 12, 15, 18, 20, 25, 30,35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1 000 or 2000 nucleotides of SEQ ID No, 1 , and the complements thereof, wherein 

25 said contiguous span comprises at least one of the following nucleotide positions of SEQ ID No 

1: 201123 to 247802 and 199122 to 249803; 

(ii) a contiguous span of at least 12, 15, 18, 20, 25,30,35,40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 2 to 17, and the complements 
thereof, to the extent that such a length is consistent with the particular sequence ID; 

30 (iii) a contiguous span of at least 8, 12, 15, 1 8, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 

80, 90, 100 or 200 nucleotides, to the extent that such a length is consistent with the particular 
sequence ID, of SEQ ID Nos. 24, 25, 28, 74 and 79 to 1 32, or the complements thereof; 

(iv) a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90 or 100 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 

3 5 span comprises at least one of the following nucleotide positions of SEQ ID No 1 : 20 1 1 23 to 
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201234, 201 123 to 201560, 214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 238715 to 238919, 240440 to 240673, 
240440 to 241 1 53, 241072 to 241291, 244353 to 244561 and 246273 to 247802; 

(v) a contiguous span according to (i) to (iv), wherein said span includes a biallelic marker 
5 selected from the group consisting of A 1 to A 1 27. 

(vi) a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID No 1, and the complements thereof, 
wherein said span includes a polymorphism selected from the group consisting of A 13 to A 18, 
A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and A123; 

10 (vii) a nucleotide sequence complementary to any one of the contiguous spans of (i) to (vi); 

and 

(viii) a polypeptide code comprising a contiguous span of at least 6 amino acids of a 
polypeptide sequence selected from the group consisting of SEQ ID Nos 18 to 23. 

15 59. A method according to claim 58, wherein said step of determining differences between the 

first sequence and the reference sequence comprises identifying at least one polymorphism. 

60. A method for identifying a feature in a sequence, comprising the steps of: 

a) reading said sequence through the use of a computer program which identifies features in 
20 sequences; and 

b) identifying features in said sequence with said computer program; 

wherein said sequence is selected from the group consisting of a nucleic acid code 
comprising one of the following; 

(i) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
25 200, 500, 1 000 or 2000 nucleotides of SEQ ID No. 1, and the complements thereof, wherein 

said contiguous span comprises at least one of the following nucleotide positions of SEQ ID No 
1 : 201 123 to 247802 and 1 99122 to 249803; 

(ii) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 150, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID Nos. 2 to 1 7, and the complements 

30 thereof, to the extent that such a length is consistent with the particular sequence ID; 

(iii) a contiguous span of at least 8, 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
80, 90, 1 00 or 200 nucleotides, to the extent that such a length is consistent with the particular 
sequence ID, of SEQ ID Nos. 24, 25, 28, 74 and 79 to 132, or the complements thereof; 

(iv) a contiguous span of at least 12, 15, 18, 20, 25, 30, 35, 40, 45, 50, 55, 60, 65, 70, 75, 
35 80, 90 or 100 nucleotides of SEQ ID No. 1 or the complements thereof wherein said contiguous 



WO 01/40493 PCT/TB00/O15O7 

211 

span comprises at least one of the following nucleotide positions of SEQ ID No 1: 201123 to 
201234, 201 123 to 201560,214676 to 214793, 215702 to 215746, 216836 to 216994, 216836 
to 217077, 217671 to 217764, 227655 to 227736, 23 8715 to 238919, 240440 to 240673, 
240440 to 241153, 241072 to 241291, 244353 to 244561 and 246273 to 247802; 
5 (v) a contiguous span according to (i) to (iv), wherein said span includes a biallelic marker 

selected from the group consisting of Al to A 127; 

(vi) a contiguous span of at least 12, 15, 1 8, 20, 25, 30, 35, 40, 50, 60, 70, 80, 90, 100, 1 50, 
200, 500, 1000 or 2000 nucleotides of any of SEQ ID No 1, and the complements thereof, 
wherein said span includes a polymorphism selected from the group consisting of A13 to A18, 

10 A20 to A46, A49 to A52, A55, A57, A59 to A63, A72 to A73, A76 and Al 23 ; 

(vii) a nucleotide sequence complementary to any one of the contiguous spans of (t) to (vi); 
and 

(viii) a polypeptide code comprising a contiguous span of at least 6 amino acids of a 
polypeptide sequence selected from the group consisting of SEQ ID Nos 18 to 23. 

15 
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SEQUENCE LISTING 



<110> GENSET 

<120> SCHIZOPHRENIA ASSOCIATED GENE, 
<130> 53.W02 



<150> 60/168,088 
<151> 1999-11-30 

<160> 134 

<170> Patent. pm 



<210> 1 
<211> 319608 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> raisc_f eature 
<222> 199122. .201122 
<223> 5 'regulatory region 

<220> 

<221> exon 

<222> 201123. .201234 
<223> exon S 

<220> 

<221> exon 

<222> 201123.-201560 
<223> exon S2 

<220> 

<221> exon 

<222> 214676. .214793 

<223> exon T 

<220> 

<221> exon 

<222> 215702. .215746 
<223> exon U 

<220> 

<221> exon 

<222> 216836. .216994 
<223> exon V 

<220> 

<221> exon 

<222> 216836. .217077 
<223> exon V2 

<220> 

<221> exon 
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<222> 217671. .217764 
<223> exon VI 



<220> 

<221> exon 

<222> 227655. .227736 

<223> exon V4 



<220> 

<221> exon 

<222> 238715. .238919 

<223> exon V3 



<220> 

<221> exon 

<222> 240440. .240673 

<223> exon W 

<220> 

<221> exon 

<222> 240440. .241153 

<223> exon W2 

<220> 

<221> exon 

<222> 241072. .241291 

<223> exon X 



<220> 

<221> exon 

<222> 244353. .244561 

<223> exon Y 

<220> 

<221> exon 

<222> 246273. .247802 

<223> exon Z 

<220> 

<221> misc_f eature 
<222> 247803. .249803 
<223> 3* regulatory region 



<220> 

<221> allele 

<222> 8316 

<223> 99-27943-150 



polymorphic base G or C 



<220> 

<221> allele 

<222> 21672 

<223> 99-27935-193 



polymorphic base G or C 



<220> 

<221> allele 
<222> 65485 

<223> 8-128-33 : polymorphic base C or T 



<220> 

<221> allele 
<222> 95396 
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<223> 99-31960-363 : polymorphic base A or G 
<ZZ0> 

<221> allele 
<222> 107281 

<223> 99-24 656-260 : polymorphic base A or G 
<220> 

<221> allele 
<222> 160640 

<223> 99-24639-163 : polymorphic base A or C 
<220> 

<221> allele 
<222> 160876 

<223> 99-24 634-108 : polymorphic base A or T 
<220> 

<221> allele 
<222> 168974 

<223> 99-7652-1.62 : polymorphic base A or G 
<220> 

<22I> allele 
<222> 170810 

<223> 99-16100-147 : polymorphic base A or G 
<220> 

<221> allele 
<222> 173358 

<223> 99-5862-167 : polymorphic base A or G 
<220> 

<221> allele 
<222> 189957 

<223> 99-5919-215 : polymorphic base A or G 
<220> 

<221> allele 
<222> 197163 

<223> 99-24 658-410 : polymorphic base A or G 
<220> 

<221> allele 
<222> 200778 

<223> 8-303-235 : polymorphic base A or G 
<220> 

<221> allele 
<222> 202651 

<223> 8-30O-221 : polymorphic base A or G 
<220> 

<221> allele 
<222> 202679 

<223> 8-300-193 : polymorphic base A or G 
<220> 

<221> allele 
<222> 203378 

<223> 8-299-128 : polymorphic base A or T 
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<220> 

<22l> allele 
<222> 204138 

<223> 8-2 96-213 : polymorphic base A or T 
<220> 

<221> allele 
<222> 204605 

<223> 8-252-190 : polymorphic base C or T 
<220> 

<221> allele 
<222> 204934 

<223> 99-24644-194 : polymorphic base A or G 
<220> 

<221> allele 
<222> 205206 

<223> 8-295-248 : polymorphic base A or C 
<220> 

<221> allele 
<222> 205329 

<223> 8-295-125 : polymorphic base C or T 
<220> 

<221> allele 
<222> 206064 

<223> 8-293-130 : polymorphic base A or G 
<220> 

<221> allele 
<222> 206545 

<223> 8-292-198 : polymorphic base A or G 
<220> 

<221> allele 
<222> 207313 

<223> 8-251-322 : polymorphic base A or G 
<220> 

<221> allele 
<222> 208285 

<223> 8-28 9-322 : polymorphic base A or G 
<220> 

<221> allele 
<222> 208960 

<223> 8-287-249 : polymorphic base C or T 
<220> 

<221> allele 
<222> 209123 

<223> 8-287-86 : polymorphic base A or T 
<220> 

<221> allele 
<222> 209631 

<223> 8-285-319 : polymorphic base A or G 
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<220> 

<221> allele 
<222> 210361 
<223> 8-283-278 

<220> 

<221> allele 
<222> 210463 
<223> 8-283-176 



polymorphic base G or C 



polymorphic base A or G 



<220> 

<221> allele 
<222> 210486 
<223> 8-283-153 

<220> 

<221> allele 
<222> 210583 
<223> 8-283-56 



polymorphic base G or C 



polymorphic base C or T 



<220> 

<221> allele 
<222> 210879 
<223> 8-282-345 



polymorphic base G or C 



<220> 

<221> allele 
<222> 210964 
<223> 8-282-260 



polymorphic base G or T 



<220> 

<221> allele 
<222> 210979 
<223> 8-282-245 



polymorphic base A or C 



<220> 

<221> allele 
<222> 211050 
<223> 8-282-174 



variable motif AAAGG or GAAGGAAGGAAGGAAGGAAGA 



<220> 

<221> allele 
<222> 211132 
<223> 8-282-92 

<220> 

<221> allele 
<222> 211247 
<223> 8-281-367 

<220> 

<221> allele 
<222> 211315 
<223> 8-281-299 



polymorphic base A or T 



polymorphic base A or G 



polymorphic base A or G 



<220> 

<221> allele 
<222> 211366 
<223> 8-281-248 



polymorphic base G or C 



<220> 
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<221> allele 
<222> 212520 

<223> 8-279-197 : polymorphic base A or C 
<220> 

<221> allele 
<222> 212821 

<223> 8-278-289 : polymorphic base C or T 
<220> 

<221> allele 
<222> 213833 

<223> 8-277-250 : polymorphic base C or T 
<220> 

<221> allele 
<222> 214857 

<223> 8-124-294 : polymorphic base A or G 
<220> 

<221> allele 
<222> 215347 

<223> 8-273-383 : polymorphic base A or G 
<220> 

<221> allele 
<222> 216967 

<223> 8-126-286 or 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 

<222> 219540 ^ 

<223> 99-5897-143 : polymorphic base A or C 
<220> 

<221> allele 
<222> 220836 

<223> 99-24649-186 : polymorphic base A or G 
<220> 

<221> allele 
<222> 223230 

<223> 8-247-337 : polymorphic base A or G 
<220> 

<221> allele 
<222> 224213 

<223> 8-24 6-393 : polymorphic base A or T 
<220> 

<221> allele 
<:222> 224792 

<223> 8-245-270 : polymorphic base A or G 
<220> 

<221> allele 
<222> 230395 

<223> 8-119-247 : polymorphic base A or G 
<220> 

<221> allele 
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<222> 231677 

<223> 99-15870-400 : polymorphic base A or G 
<220> 

<221> allele 
<222> 235315 

<223> 99-16087-219 ^polymorphic base G or C 
<220> 

<221> allele 
<222> 238789 

<223> 8-155-258 : polymorphic base C or T 
<220> 

<221> allele 
<222> 239763 

<223> 8-136-267 or 99-16038-118 ; polymorphic base C or T 
<220> 

<221> allele 
<222> 241861 

<223> 99-15880-162 : polymorphic base A or G 
<220> 

<221> allele 
<222> 243700 

<223> 8-271-227 : polymorphic base A or G 
<220> 

<221> allele 
<222> 245264 

<223> 8-267-273 : deletion of GTGGGC 
<220> 

<221> allele 
<222> 245663 

<223> 8-266-131 : polymorphic base A or C 
<220> 

<221> allele 
<222> 245966 

<223> 8-265-208 : polymorphic base A or G 
<220> 

<221> allele 
<222> 245971 

<223> 8-265-203 : polymorphic base G or C 
<220> 

<221> allele 
<222> 247864 

<223> 99-25940-182 : polymorphic base C or T 
<220> 

<221> allele 
<222> 248315 

<223> 99-16032-2 92 : polymorphic base G or T 
<220> 

<221> allele 
<222> 255848 
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<223> 99-16105-152 : 
<220> 

<221> allele 
<222> 260099 
<223> 99-16033-244 : 

<220> 

<221> allele 
<222> 279789 
<223> 99-15875-165 : 

<220> 

<221> allele 
<222> 300365 
<223> 99-16052-214 : 

<220> 

<221> allele 
<222> 312030 
<223> 99-16047-115 : 

<220> 

<221> allele 
<222> 316014 
<223> 99-25993-367 : 



polymorphic base A or G 



polymorphic base A or G 



polymorphic base C or T 



polymorphic base A or G 



polymorphic base A or G 



polymorphic base A or G 



<220> 

<221> primerjbind 
<222> 7938. .7958 
<223> 99-27943. rp 

<2 20> 

<2 21> primerjbind 

<222> 8446. .8465 

<223> 99-27943. pu complement 

<220> 

<221> primer_bind 
<222> 21365.. 21385 
<223> 99-27935. rp 

<220> 

<221> primerjbind 

<222> 21845. .21864 

<223> 99-27935. pu complement 

<220> 

<221> primer_bind 
<222> 65453. .65471 
<223> 8-128. pu 

<220> 

<221> primerjbind 

<222> 658S6..65874 

<223> 8-128.rp complement 

<220> 

<221> primerjbind 
<222> 95034. .95053 
<223> 99-31960. pu 
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<220> 

<221> primer__bind 

<222> 95543. .95563 

<223> 99-31960, rp complement 

<220> 

<221> primer_bind 
<222> 107022. .107040 
<223> 99-24656. pu 

<220> 

<221> primer_bind 

<222> 107495. .107513 

<223> 99-24656. rp complement 

<220> 

<221> primer__bind 
<222> 160279. .160298 
<223> 99-24639. rp 

<220> 

<221> primer_bind 
<222> 160770. .160787 
<223> 99-24634. pu 

<220> 

<221> primer_bind 

<222> 160785. .160802 

<223> 99-24639.pu complement 

<220> 

<221> primer_bind 

<222> 161240. .161257 

<223> 99-24 634. rp complement 

<220> 

<221> prime r_bind 
<222> 168813. .168830 
<223> 99-7652. pu 

<220> 

<221> primerjoind 

<222> 169331. .169351 

<223> 99-7652. rp complement 

<220> 

<221> primer_bind 
<222> 170666.,. 170686 
<223> 99-16100. pu 

<220> 

<221> primer_bind 

<222> 171153. .171173 

<223> 99-16100. rp complement 

<220> 

<221> primer_bind 
<222> 173065. .173085 
<223> 99-5862. rp 
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<220> 

<221> primer_bind 

<222> 173495. .173514 

<223> 99-5862, pu complement 

<220> 

<221> primer bind 
<222> 189753. .189771 
<223> 99-5919. pu 

<220> 

<221> primer_bind 

<222> 190187. .190207 

<223> 99-5919. rp complement 

<220> 

<221> primer__bind 

<222> 197116. .197135 

<223> 99-24658. rp 

<220> 

<221> primer_bind 

<222> 197555. .197572 

<223> 99-24 658. pu complement 

<220> 

<221> primer__bind 
<222> 200587 . .200606 
<223> 8-303. rp 

<220> 

<221> primer_bind 
<222> 200994 .201012 
<223> 8-303. pu complement 

<220> 

<221> prime rjbi.nd 
<222> 202440. .202459 
<223> 8-300. rp 

<220> 

<221> prime r_bind 
<222> 202852. .202871 
<223> 8-300. pu complement 

<220> 

<221> prime r^bind 
<222> 203061. .203080 
<223> 8-299. rp 

<220> 

<221> primerjoind 
<222> 203486. .203505 
<223> 8-299. pu complement 

<220> 

<221> primer_bind 
<222> 203943. .203962 
<223> 8-296. rp 

<220> 
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<221> primer_bind 
<222> 204345. .204362 
<223> 8-296. pu complement 

<220> 

<221> primer__bind 
<222> 204373. .204392 - 
<223> 8-252. rp 

<220> 

<221> primer__bind 
<222> 204741. .204758 
<223> 99-24644. pu 

<220> 

<221> primerjoind 
<222> 204777 . .204794 
<223> 8-252. pu complement 

<220> 

<221> prime r_bind 
<222> 205034 . .205052 
<223> 8-295. rp 

<220> 

<221> primer_bind 

<222> 205222.. 205240 

<223> 99-24644. rp complement 

<220> 

<221> primerjbind 
<222> 205435. .205453 
<223> 8-295. pu complement 

<220> 

<221> primerjbind 
<222> 205768. .205786 
<223> 8-293. rp 

<220> 

<221> primer_bind 
<222> 206175. .206193 
<223> 8-293. pu complement 

<220> 

<221> primer_bind 
<222> 206316. .206333 
<223> 8-292. rp 

<220> 

<221> primer_bind 
<222> 206725. .206742 
<223> 8-292. pu complement 

<220> 

<221> primer_bind 
<222> 207186. .207205 
<223> 8-251. rp 

<220> 

<221> primer_bind 
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<222> 207616. .207634 
<223> 8-251. pu complement 

<220> 

<221> prime r_bind 
<222> 208186. .208205 
<223> 8-289. rp 

<220> 

<221> primer_bind 
<222> 208588. .208606 
<223> 8-289. pu complement 

<220> 

<221> primer_bind 
<222> 208786. .208805 
<223> 8-287. rp 

<220> 

<221> primer_bind 
<222> 209189. .209208 
<223> 8-287, pu complement 

<220> 

<221> primer_bind 
<222> 209520. .209539 
<223> 8-285. rp 

<220> 

<221> primer_bind 
<222> 209931. .209949 
<223> 8-285. pu complement 

<220> 

<221> prime r_bind 

<222> 210219. .210237 

<223> 8-283. rp 

<220> 

<221> primer_bind 
<222> 210620. .210638 
<223> 8-283. pu complement 

<220> 

<221> primer_bind 
<222> 210801. .210820 
<223> 8-282. rp 

<220> 

<221> primer_bind 
<222> 211182. .211199 
<223> 8-281. rp 

<220> 

<221> primer Joind 

<222> 211205. .211223 

<223> 8-282 .pu complement 

.<220> 

<221> primer bind 
<222> 2115967.211613 
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<223> 8-281. pu complement 
<220> 

<221> primerjaind 
<222> 212290. .212309 
<223> 8-279. rp 

<220> 

<221> primer_bind 

<222> 212662. .212679 

<223> 8-278. rp 

<220> 

<221> primer_bind 
<222> 212698. .212716 
<223> 8-279. pu complement 

<220> 

<221> primer__bind 

<222> 213091. .213109 

<223> 8-278 -pu complement 

<220> 

<221> primer_bind 

<222> 213783. .213803 

<223> 8-277. rp 

<220> 

<221> primer_bind 

<222> 214062. .214082 

<223> 8-277. pu complement 

<220> 

<221> primer_bind 
<222> 214564 . .214581 
<223> 8-124. pu 

<220> 

<221> primer_bind 
<222> 214965. .214983 
<223> 8-124. rp complement 

<220> 

<221> primer_bind 
<222> 215310. .215328 
<223> 8-273. rp 

<220> 

<221> primer_bind 
<222> 215711. .215729 
<223> 8-273. pu complement 

<220> 

• <221> primerjaind 
<222> 216683. .216702 
<223> 8-130, rp 

<220> 

<221> primer_bind 

<222> 216687. .216704 

<223> 8-126. pu 
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<220> 

<221> primer — bind 
<222> 217082. .217101 
<223> 8-126. rp complement 

<220> 

<221> primer^bind 
<222> 217091. .217109 
<223> 8-130, pu complement 

<220> 

<221> primer_bind 
<222> 219408. .219425 
<223> 99-5897. pu 

<220> 

<221> primer_bind 

<222> 219882. .2198S9 

<223> 99-5897. rp complement 

<220> 

<221> primer__bind 
<222> 220505. .220522 
<223> 99-24649. rp 

<220> 

<221> primer_bind 

<222> 221004 . .221021 

<223> 99-24649. pu complement 

<220> 

<221> primer_bind 
<222> 223011. .223028 
<223> 8-247. rp 

<220> 

<221> priiner__bind 
<222> 223549. .223566 
<223> 8-247. pu complement 

<220> 

<221> primer__bind 
<222> 223962. .223978 
<223> 8-246. rp 

<220> 

<221> primerjDind 
<222> 224564. .224581 
<223> 8-245. rp 

<220> 

<221> prirnerjoind 
<222> 224588. .224605 
<223> 8-246. pu complement 

<220> 

<221> primer_bind 
<222> 225044. .225061 
<223> 8-245. pu complement 
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<220> 

<221> primer_bind 
<222> 230210. .230227 
<223> 8-119. rp 

<220> 

<221> primer_bind 
<222> 230622. .230641 
<223> 8-119. pu complement 

<220> 

<221> primer_bind 
<222> 231278 . .231298 
<223> 99-15870. pu 

<220> 

<221> primer__bind 

<222> 231729. .231747 

<223> 99-15870. rp complement 

<220> 

<2 21> primer_bind 
<222> 235081. .235101 
<223> 99-16087. rp 

<220> 

<221> primer_bind 

<222> 235515. .235533 

<223> 99-16087. pu complement 

<220> 

<221> primer_bind 
<222> 238607. .238626 
<223> 8-155. rp 

<220> 

<221> prxmer_bind 
<222> 239029. .239046 
<223> 8-155. pu complement 

<220> 

<221> primer Jbind 
<222> 239405. .239425 
<223> 99-16038. rp 

<220> 

<221> primer_bind 
<222> 239606. .239624 
<223> 8-136. rp 

<220> 

<221> primer_bind 

<222> 239862. .239880 

<223> 99-16038. pu complement 

<220> 

<221> primerjDind 
<222> 240012. .240029 
<223> 8-136. pu complement 

<220> 
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<221> primer_bind 
<222> 241700. .241717 
<223> 99-15380. pu 

<220> 

<221> primer_bind 

<222> 242151. .242171 ~ ' 

<223> 99-15880. rp complement 

<220> 

<221> prirner_bind 
<222> 243484. .243502 
<223> 8-271. rp 

<220> 

<221> prime rjaind 
<222> 243909* .243926 
<223> 8-271 .pu complement 

<220> 

<221> primer_bind 
<222> 245105. .245123 
<223> 8-267. rp 

<220> 

<221> primer__bind 
<222> 245372. .245389 
<223> 8-266. rp 

<220> 

<221> primer_bind 
<222> 245518. .245535 
<223> 8-267.pu complement 

<220> 

<221> primer_bind 
<222>- 245754. .245772 
<223> 8-265, rp 

<220> 

<221> primer_bind 
<222> 245775. .245793 
<223> 8-2 66.pu complement 

<220> 

<221> primer_bind 
<222> 246155. .246173 
<223> 8-265. pa complement 

<220> 

<221> prime r_bind 
<222> 247513. .247533 
<223> 99-25940. rp 

<220> 

<221> primer_bind 

<222> 248023. .248043 

<223> 99-25940. pu complement 

<220> 

<221> primer_bind 
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<222> 248204 . .248223 
<223> 99-16032. rp 

<220> 

<221> primer_bind 

<222> 248588. .248606 

<223> 99-1 6032. pu comp-lement 

<220> 

<221> primer_bind 
<222> 255697. .255715 
<223> 99-16105. pu 

<220> 

<221> primer_bind 

<222> 256133. .256152 

<223> 99-16105. rp complement 

<220> 

<221> primer_bind 
<222> 259885. .259902 
<223> 99-16033. rp 

<220> 

<221> primer_bind 

<222> 260324. .260342 

<223> 99-1 6033. pu complement 

<220> 

<221> prime rjoind 
<222> 279626. .279644 
<223> 99-15875. pu 

<220> 

<221> primer_bind 

<222> 280154, .280173 

<223> 99-15875. rp complement 

<220> 

<221> primer_bind 
<222> 300153. .300170 
<223> 99-16052. pu 

<220> 

<221> primer_bind 

<222> 300660. .300680 

<223> 99-1 6052. rp complement 

<220> 

<221> primer_bind 
<222> 311615. .311632 
<223> 99-16047. rp 

<220> 

<221> primer_bind 

<222> 31212.6. .312144 

<223> 99-16047. pu complement 

<220> 

<221> primer_bind 
<222> 315649. .315668 
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<223> 



99-25993. pu 



<220> 
<221> 
<222> 
<223> 



primer_bind 
316129. .316147 
99-25993. rp compl emen t 



<220> 

<221> primerjoind 
<222> 8297. .8315 
<223> 99-27943-150. mis 

<220> 

<221> prime r_Jbind 
<222> 8317, .8335 

<223> 99-27 943-150. mis complement 
<220> 

<221> primerjbind 
<222> 21653. .21671 
<223> 99-27935-193. mis 

<220> 

<221> primer_bind 

<222> 21673. .21691 

<223> 99-27935-193. mis complement 

<220> 

<221> primer__bind 

<222> 65466. .65484 

<223> 8-128-33. mis 

<220> 

<221> primerjoind 

<222> 65486. . 65504 

<223> 8-128-33. mis complement 

<220> 

<221> primer_ bind 
<222> 95377. .95395 
<223> 99-31960-363. mis 

<220> 

<221> primerjoind 

<222> 95397. .95415 

<223> 99-31960-363. mis complement 

<220> 

.<221> primer__bind 
<222> 107262. .107280 
<223> 99-24656-260. mis 

<220> 

<221> prime r_bind 

<222> 107282. .107300 

<223> 99-24 656-260. mis complement 

<220> 

<221> primer__bind 
<222> 160621. . 160639 
<223> 99-24639-163. mis 
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<220> 

<221> prime r_bind 

<222> 160641. .160659 

<223> 99-24 639-163 .mis complement 

<220> 

<221> primer Joind 
<222> 160857. .160875 
<223> 99-24634-108. mis 

<220> 

<221> primer_bind 

<222> 160877. , 160895 

<223> 99-24634-108 .mis complement 

<220> 

<221> prime r_bind 

<222> 168955. .168973 

<223> 99-7652-162. mis 

<220> 

<221> primer Jbind 

<222> 168975. .168993 

<223> 99-7652-162. mis complement 

<220> 

<221> primer__bind 
<222> 170791. . 170809 
<223> 99-16100-147 .mis 

<220> 

<221> primer__bind 

<222> 170811. .170829 

<223> 99-16100-147 .mis complement 

<220> 

<221> primer_bind 
<222> 173339. .173357 
<223> 99-5862-167. mis 

<220> 

<221> primer_bind 

<222> 173359. .173377 

<223> 99-5862-167 .mis complement 

<220> 

<221> primer_bind 
<222> 189938 . .189956 
<223> 99-5919-215. mis 

<220> 

<221> priraer_bind 

<222> 189958. .189976 

<223> 99-5919-215 .mis complement 

<220> 

<221> primer_bind 
<222> 197144 . .197162 
<223> 99-24658-410. mis 



19 



WO 01/40493 



PCT7IB00/01507 



<220> 

<221> primer_bind 

<222> 197164 . .197182 

<223> 99-24658-410. mis complement 

<220> 

<221> primer_bind 
<222> 200759. .200777 
<223> 8-303-235. mis 

<220> 

<221> primer_bind 

<222> 200779. .200797 

<223> 8-303-235. mis complement 

<220> 

<221> primer_bind 
<222> 202632. .202650 
<223> 8-300-221 .mis 

<220> 

<221> primer_bind 

<222> 202652. .202670 

<223> 8-300-221 .mis complement 

<220> 

<221> primer_bind 
<222> 202660. .202678 
<223> 8-300-193. mis 

<220> 

<221> primerjaind 

<222> 202680. .202698 

<223> 8-300-193. mis complement 

<220> 

<221> primer_bind 
<222> 203359. .203377 
<223> 8-299-128 .mis 

<220> 

<221> primer_bind 

<222> 203379, .203397 

<223> 8-299-128 .mis complement 

<220> 

<221> primerjaind 
<222> 204119. .204137 
<223> 8-296-213. mis 

<220> 

<221> primer_bind 

<222> 204139. .204157 

<223> 8-296-213. mis complement 

<220> 

<221> primer_bind 
<222> 204586. .204604 
<223> 8-252-190. mis 

<220> 
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<221> prime r_bind 

<222> 204606. .204624 

<223> 8-252-190. mis complement 

<220> 

<221> primer_bind 
<222> 204915. .204933 - ■ 
<223> 99-24644-194. mis 

<220> 

<221> primer_bind 

<222> 204935. .204953 

<223> 99-24 64 4-194 -mis complement 

<220> 

<221> primerjbind 
<222> 205187. .205205 
<223> 8-295-248. mis 

<220> 

<221> primer_bind 

<222> 205207. .205225 

<223> 8-295-248. mis complement 

<220> 

<221> primer_bind 
<222> 205310. .205328 
<223> 8-295-125. mis 

<220> 

<221> primer_bind 

<222> 205330. .205348 

<223> 8-295-125. mis complement 

<220> 

<221> primer_bind 
<222> 206045. .206063 
<223> 8-293-130. mis 

<220> 

<221> primer__bind 

<222> 206065. .206083 

<223> 8-293-130. mis complement 

<220> 

<221> primer_bind 
<222> 206526. .206544 
<223> 8-292-198. mis 

<220> 

<221> primer_bind 

<222> 206546. .206564 

<223> 8-292-198. mis complement 

<220> 

<221> primer__bind 
<222> 207294. .207312 
<223> 8-251-322. mis 

<220> 

<221> prime r_bind 
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<222> 207314 . .207332 

<223> 8-251-322 .mis complement 

<220> 

<221> primer_bind 
<222> 208266. .208284 
<223> 8-289-322. mis 

<220> 

<221> primer_bind 

<222> 208286. .208304 

<223> 8-289-322 .mis complement 

<220> 

<221> primer_bind 
<222> 208941. .208959 
<223> 8-287-249. mis 

<220> 

<221> primerjoind 

<222> 208961. .208979 

<223> 8-287-24 9. mis complement 

<220> 

<221> primer_bind 
<222> 209104 . .209122 
<223> 8-287-86. mis 

<220> 

<221> primerjaind 

<222> 209124 . .209142 

<223> 8-287-86. mis complement 

<220> 

<221> priraer_bind 
<222> 209612. .209630 
<223> 8-285-319. mis 

<220> 

<221> primer_bind 

<222> 209632. .209650 

<223> 8-285-319. mis complement 

<220> 

<221> primer_bind 
<222> 210342. .210360 
<223> 8-283-278. mis 

<220> 

<221> primer_bind 

<222> 210362. .210380 

<223> 8-283-278 .mis complement 

<220> 

<221> primer_bind 
<222> 210444 . .210462 
<223> 8-283-176. mis 

<220> 

<221> primer_bind 
<222> 210464. .210482 
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<223> 8-283-176. mis complement 
<220> 

<221> primer_bind 
<222> 210467 . .210485 
<223> 8-283-153. mis 

<220> 

<221> primer__bind 

<222> 210487. .210505 

<223> 8-283-153. mis complement 

<220> 

<221> primer_bind 
<222> 210564 . .210582 
<223> 8-283-56. mis 

<220> 

<221> primer_bind 
<222> 210584 . .210602 
.<223> 8-283-56. mis complement 

<220> 

<221> primer_bind 
<222> 210860. .210878 
<223> 8-282-345. mis 

<220> 

<221> primer_bind 

<222> 210880. .210898 

<223> 8-282-345. mis complement 

<220> 

<221> primer__bind 
<222> 210945. .210963 
<223> 8-282-260. mis 

<220> 

<221> primer_bind 
<222> 210960. .210978 
<223> 8-282-245. mis 

<220> 

<221> primer_bind 

<222> 210965- .210983 

<223> 8-282-260. mis complement 

<220> 

<221> primer_bind 

<222> 210980. .210998 

<223> 8-282-245 .mis complement 

<220> 

<221> prime rjbind 
<222> 211113. .211131 
<223> 8-282-92. mis 

<220> 

<221> primer_bind 

<222> 211133. .211151 

<223> 8-282-92. mis complement 
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<220> 

<221> primer_bind 
<222> 211228 . .211246 
<223> 8-281-367 .mis 

<220> 

<221> primerjaind 

<222> 211248 • .211266 

<223> 8-281-367 .mis complement 

<220> 

<221> primerjoind 
<222> 211296. .211314 
<223> 8-281-299. mis 

<220> 

<221> primer_bind 
<222> 211316. .211334 
.<223> 8-281-299. mis complement 

<220> 

<221> primerjoind 
<222> 211347 . .211365 
<223> 8-281-248 .mis 

<220> 

<221> prime rjoind 

<222> 211367 . .211385 

<223> 8-281-248 .mis complement 

<220> 

<221> primer _bind 
<222> 212501. .212519 
<223> 8-279-197. mis 

<220> 

<221> primerjbind 

<222> 212521. .212539 

<223> 8-279-197 .mis complement 

<220> 

<221> primerjaind 
<222> 212802. .212820 
<223> 8-278-289. mis 

<220> 

<221> primer_bind 

<222> 212822. .212840 

<223> 8-278-289. mis complement 

<220> 

<221> primer_bind 
<222> 213814. .213832 
<223> 8-277-250. mis 

<220> 

<221> primerjoind 

<222> 213834. .213852 

<223> 8-277-250. mis complement 
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<220> 

<221> primer_bind 
<222> 214838. .214856 
<223> 8-124-294 .mis 

<220> 

<221> primer_bind 

<222> 214858. .214876 

<223> 8-124-294. mis complement 

<220> 

<221> primer_bind 
<222> 215328 . .215346 
<223> 8-273-383. mis 

<220> 

<221> primer__bind 

<222> 21.5348. .215366 

<223> 8-273-383. mis complement 

<220> 

<221> primer_bind 
<222> 216948. .216966 
<223> 8~126-286*mis 

<220> 

<221> primer_bind 
<222> 216948. .216966 
<223> 8-130-143. mis 

<220> 

<221> primer_bind 

<222> 216968. .216986 

<223> 8-12 6-286 .mis complement 

<220> 

<221> prime r_bind 

<222> 216968. .216986 

<223> 8-130-143. mis complement 

<220> 

<221> primer_bind 
<222> 219521. .219539 
<223> 99-5897-143. mis 

<220> 

<221> primer_bind 

<222> 219541. .219559 

<223> 99-5897-143 .mis complement 

<220> 

<221> primer_bind 
<222> 220817. .220835 
<223> 99-24649-186. mis 

<220> 

<221> primer_bind 

<222> 220837. .220855 

<223> 99-24 649-186. mis complement 

<220> 



25 



WO 01/40493 



PCT/EBOO/01507 



<221> primerjoind 
<222> 223211- .223229 
<223> 8-247-337 .mis 

<220> 

<221> primer__bind 

<222> 223231. .223249 

<223> 8-247-337 .mis complement 

<220> 

<221> primer__bind 
<222> 224194. .224212 
<223> 8-246-393. mis 

<220> 

<221> primer__bind 

<222> 224214. .224232 

<223> 8-24 6-393 .mis complement 

<220> 

<221> primerjblnd 
<222> 224773. .224791 
<223>, 8-245-270. mis 

<220> 

<221> primer_bind 

<222> 224793. .224811 

<223> 8-245-270. mis complement 

<220> 

<221> primer__bind 
<222> 230376. .230394 
<223> 8-119-247 /mis 

<220> 

<221> primer_bind 

<222> 230396. .230414 

<223> 8-119-247 .mis complement 

<220> 

<221> primer_bind 
<222> 231658. .231676 
<223> 99-15870-400. mis 

<220> 

<221> primerjoind 

<222> 231678. ,231696 

<223> 99-15870-400 .mis complement 

<220> 

<221> primerjoind 
<222> 235296. .235314 
<223> 99-16087-219. mis 

<220> 

<221> primerjoind 

<222> 235316. .235334 

<223> 99-16087-219 .mis complement 

<220> 

<221> primer_bind 
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<222> 238770. .238788 
<223> 8-155-258. mis 

<220> 

<221> primer_bind 

<222> 238790. .238808 

<223> 8-155-258 .mis complement 

<220> 

<221> primer_bind 
<222> 239744. .239762 
<223> 8-136-267 .mis 

<220> 

<221> prirtier_bind 

<222> 239744 . .239762 

<223> 99-16038-118. mis 

<220> 

<221> primer_bind 

<222> 239764. .239782 

<223> 8-136-267 .mis complement 

<220> 

<221> primer_bind 

<222> 239764 . .239782 

<223> 99-16038-118 .mis complement 

<220> 

<221> primer__bind 
<222> 241842. .241860 
<223> 99-15880-162. mis 

<220> 

<221> prime rjoind 

<222> 241862 . ,241880 

<223> 99-15880-162. mis complement 

<220> 

<221> primer_bind 
<222> 243681. .243699 
<223> 8-271-227. mis 

<220> 

<221> prime r_bind 

<222> 243701. .243719 

<223> 8-271-227. mis complement 

<220> 

<221> primerjbind 
<222> 245644. ,245662 
<223> 8-266-131. mis 

<220> 

<221> primer_bind 

<222> 245664. .245682 

<223> 8-266-131. mis complement 

<220> 

<221> primerjoind 
<222> 245947. .245965 
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<223> 8-265-208 .mis 
<220> 

<221> primer_bind 
<222> 245952. .245970 
<223> 8-265-203. mis 

<220> 

<221> primerjDind 

<222> 245967. .245985 

<223> 8-265-208 .mis complement 

<220> 

<221> primer_bind 

<222> 245972. .245990 

<223> 8-265-203 .mis complement 

<220> 

<221> prime r — bind 
<222> 247845. .247863 
<223> 99-2594 0-182. mis 

<220> 

<221> primer_bind 

<222> 247865. .247883 

<223> 99-25940-182. mis complement 

<220> 

<221> primer_bind 
<222> 248296. .248314 
<223> 99-16032-292. mis 

<220> 

<221> primer_bind 

<222> 248316. .248334 

<223> 99-1 6032-292. mis complement 

<220> 

<221> prime r_bind 

<222> 255829. .255847 

<223> 99-16105-152. mis 

<220> 

<221> prime r_bind 

<222> 255849. .255867 

<223> 99-16105-152. mis complement 

<220> 

<221> primer_bind 
<222> 260080. .260098 
<223> 99-16033-244 .mis 

<220> 

<221> primer_bind 

<222> 260100. .260118 

<223> 99-16033-244 .mis complement 

<220> 

<221> primer_bind 
<222> 279770. .279788 
<223> 99-15875-165. mis 
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<220> 

<221> prime r__b in d 

<222> 279790. .279808 

<223> 99-15875-165. mis" complement 

<220> 

<221> primer_bind 
<222> 300346. .300364 
<223> 99-16052-214 .mis 

<220> 

<221> primer_bind 

<222> 300366. .300384 

<223> 99-16052-214 .mis complement 

<220> 

<221> prime r_bind 
<222> 312011. .312029 
<223> 99-16047-115. mis 

<220> 

<221> primerjaind 

<222> 312031. .312049 

<223> 99-16047-115. mis complement 

<220> 

<221> primer_bind 
<222> 315995. .316013 
<223> 99-25993-367. mis 

<220> 

<221> primer_bind 

<222> 316015. .316033 

<223> 99-25993-367 .mis complement 

<220> 

<221> misc_binding 

<222> 8304. .8328 

<223> 99-27943-150. probe 

<220> 

<221> raiscjoinding 
<222> 21660. .21684 
<223> 99-27935-193. probe 

<220> 

<221> misc__binding 
<222> 65473.-65497 
<223> 8-128-33. probe 

<220> 

<221> misc_binding 
<222> 95384. .95408 
<223> 99-31960-363. probe 

<220> 

<221> raisc_binding 
<222> 107269. .107293 
<223> 99-24656-260. probe 
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<220> 

<221> miscjbinding 
<222> 160628. .160652 
<223> 99-24639-163. probe 

<220> 

<221> misc__binding 
<222> 160864 . .160888 
<223> 99-24634-108 .probe 

<220> 

<221> miscjbinding 
<222> 168962. .168986 
<223> 99-7652-162. probe 

<220> 

<221> misc_binding 
<222> 170798 . .170822 
<223> 99-16100-147. probe 

<220> 

<221> misc__binding 
<222> 173346. ,173370 
<223> 99-5862-167. probe 

<220> 

<221> miscjsinding 
<222> 189945, .189969 
<223> 99-5919-215. probe 

<220> 

<221> miscjbinding 
<222> 197151. .197175 
<223> 99-24658-410. probe 

<220> 

<221> miscjbinding 
<222> 200766. .200790 
<223> 8-303-235. probe 

<220> 

<221> misc binding 
<222> 202639. .202663 
<223> 8-300-221. probe 

<220> 

<221> miscjbinding 
<222> 202667. .202691 
<223> 8-300-193. probe 

<220> 

<221> misc_binding 
<222> 203366. .203390 
<223> 8-299-128 .probe 

<220> 

<221> misc_ binding 
<222> 204126. .204150 
<223> 8-296-213. probe 

<220> 
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<221> misc — binding 
<222> 204593. .204617 
<223> 8-252-190. probe 

<220> 

<221> misc_binding 
<222> 204922. .204946" 
<223> 99-24644-194. probe 

<220> 

<221> misc_binding 
<222> 205194. .205218 
<223> 8-295-248. probe 

<220> 

<221> miscjoinding 
<222> 205317. .205341 
<223> 8-295-125. probe 

<220> 

<221> miscjoinding 
<222> 206052. .206076 
<223> 8-293-130. probe 

<220> 

<221> misc_binding 
<222> 206533. .206557 
<223> 8-292-198. probe 

<220> 

<221> mis*c_binding 
<222> 207301. ,207325 
<223> 8-251-322. probe 

<220> 

<221> misc_binding 
<222> 208273. .208297 
<223> 8-289-322. probe 

<220> 

<221> misc_ binding 

<222> 208948. .208972 

<223> 8-287-249. probe 

<220> 

<221> misc_binding 
<222> 209111. ,209135 
<223> 8-287-86. probe 

<220> 

<221> misc_binding 
<222> 209619. .209643 
<223> 8-285-319. probe 

<220> 

<221> misc__binding 
<222> 210349. .210373 
<223> 8-283-278, probe 

<220> 

<221> mi sc_ binding 
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<222> 210451. .210475 
<223> 8-283-176. probe 

<220> 

<221> mi sc_bi riding 
<222> 210474. .210498 
<223> 8-283-153. probe 

<220> 

<221> misc_binding 
<222> 210571. .210595 
<223> 8-283-56. probe 

<220> 

<2 21> misc__binding 
<222> 210867. .210891 
<223> 8-282-345, probe 

<220> 

<221> misc_binding 
<222> 210952. .210976 
<223> 8-282-260. probe 

<220> 

<221> miscjainding 
<222> 210967. .210991 
<223> 8-282-245. probe 

<220> 

<221> misc binding 
<222> 211120. .211144 
<223> 8-282-92 .probe 

<220> 

<221> misc_binding 
<222> 211235. .211259 
<223> 8-281-367. probe 

<220> 

<221> miscjbinding 
<222> 211303. .211327 
<223> 8-281-299. probe 

<220> 

<221> misc_binding 
<222> 211354. .211378 
<223> 8-281-248. probe 

<220> 

<221> misc_binding 
<222> 212508. .212532 
<223> 8-279-197, probe 

<220> 

<221> miscjbinding 

<222> 212809. .212833 

<223> 8-278-289. probe 

<220> 

<221> misc_binding 
<222> 213821. .213845 
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<223> 8-277-250. probe 
<220> 

<221> misc_binding 
<222> 214845. .214869 
<223> 8-124-294. probe 

<220> 

<221> misc_binding 
<222> 215335. .215359 
<223> 8-273-383. probe 

<220> 

<221> misc_binding 
<222> 216955. .216979 
<223> 8-126-286. probe 

<220> 

<221> misc_binding 
<222> 216955. .216979 
<223> 8-130-143. probe 

<220> 

<221> misc_binding 
<222> 219528. .219552 
<223> 99-5897-143. probe 

<220> 

<221> miscjoinding 
<222> 220824. .220848 
<223> 99-24649-186. probe 

<220> 

<221> mi sc__bi riding 
<222> 223218. .223242 
<223> 8-247-337. probe 

<220> 

<221> misc_binding 
<222> 224201. .224225 
<223> 8-246-393. probe 

<220> 

<22i> misc_binding 
<222> 224780. .224804 
<223> 8-245-270. probe 

<220> 

<221> misc_binding 
<222> 230383. .230407 
<223> 8-119-247 .probe 

<220> 

<221> misc_binding 
<222> 231665. .231689 
<223> 99-15870-400 .probe 

<220> 

<221> misc_binding 
<222> 235303. .235327 
<223> 99-16087-219. probe 
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<220> 

<221> misc_binding 
<222> 238777. .238801 
<223> 8-155-258 .probe 

<220> 

<221> misc_binding 
<222> 239751. .239775 
<223> 8-136-267 .probe 

<220> 

<221> misc_binding 
<222> 239751. .239775 
<223> 99-16038-118. probe 

<220> 

<221> itiisc_binding 
<222> 241849- .241873 
<223> 99-15880-162. probe 

<220> 

<221> miscjbinding • 
<222> 243688. .243712 
<223> 8-271-227 .probe 

<220> 

<221> miscjoinding 
<222> 245651. .245675 
<223> 8-266-131 .probe 

<220> 

<221> misc_binding 
<222> 245954 . .245978 
<223> 8-265-208 .probe 

<220> 

<221> misc_binding 
<222> 245959. .245983 
<223> 8-265-203. probe 

<220> 

<221> misc_binding 
<222> 247852. .247876 
<223> 99-25940-182. probe 

<220> 

<221> misc_binding 
<222> 248303. .248327 
<223> 99-16032-292. probe 

<220> 

<221> miscjbinding 
<222> 255836.. 255860 
<223> 99-16105-152. probe 

<220> 

<221> misc_binding 
<222> 260087. .260111 
<223> 99-16033-244. probe 
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<220> 

<221> mi sc_bi riding 
<222> 279777- .279801 
<223> 99-15875-165. probe 

<220> 

<221> misc_binding 
<222> 300353. .300377 
<223> 99-16052-214 .probe 

<220> 

<221> misc__binding 
<222> 312018 . .312042 
<223> 99-16047-115. probe 

<220> 

<221> misc_binding 
<222> 316002. ,316026 
<223> 99-25393-367. probe 

<220> 

<221> miscjfeature 

<222> 4222,11245,27205. .27206,27212. .27213, 27240 , 27.347 , 27378 , 28258 

108818,128283,137062. .137063,164757,172148. .172150,18 0820, 184 929 
1944 60,200104,207061. -207062,20794 9,239085,249787,249806,250156 
25018 9,2508 41,251114,264938,265019,26507 8,274 640,277770,314 308 
318680,318731 

<223> n=a, g, c or t 

<400> 1 

cgggcgaatt cgagctcggt acccggggat catctttgtc atagattgta gtaaaaataa 
tctgtgaaaa gtgactgatt attgacaggt agccttgggt atgcagctca agtaggcatt 
ataaaataaa aaacatcaca ttccatgtat tttcaaaaaa agcttctggc ttactcaaaa 
ggtaagtgtt ctttaaacaa cttcaatgga aaaggagttt cttccaaatg cttggagaac 
tccttctcaa gtccggaata aagcaaggtc tacagagaca aagtataaat aaaaccatgg 
tgctgagctg gtaccactat tgggctgcta agcactgcta agcattgcct tcagatgagt 
tacctagcct ttgtaaagct cagattccac agctataaac tgagatgaat aataatacac 
aaaacactaa gctgttgagt agattaaatg aaattattta tgtagagcaa atagaacagt 
actgtacctt ataaaattta aaaccaaaat gtaaattctg atttttaccc aagggtcaac 
tccgcacaat tttgcaatgg acccttcccc ggccctccag aagctgaata tcgttgtgta 
tcatatccaa attcagtatc accagggcat agagtaagtg ccaactgtga ggccatagct 
atgcgtccac tcccctgaaa cacattgtgt accttcattc ttggagttga gtctttgcat 
ttcatcctcc agagtgagaa tagactgcgt gttggagagg tcagggaaat tatgtctttc 
tatgtctagc cctttatagg tcgattattt gaatgcttga cagcatctgt ccacttatct 
atccctgcaa taataaaata cgggactaaa gtagctcgta tctctagtgg gagagtgcat 
atattttata cgtcaattct ttaaaagggg aaattctttc agagcttcag aaaatatcac 
ggaagagtta aaagaatgtt tgagaagaac atgtctgagt attacctact agcattacaa 
ttttactgct ttaaagtaat tataattttc attgcaggat ctattaaaaa ggcttttacc 
tgacggtaat tcttcttgta cttacaggtc attattacaa tgaagggaag aaaggaagac 
atgaagggga aatgggacta tttaccaagt gtatctgcat agactaaagg agacaagagg 
aagtactgag caccttgatt cctgcagata attatttaag aaccataaga attctttcct 
agcttctcat aagcctgctc cttcaaaggg tgagtagttg acatgatttt acatcccatt 
tatatttctt tattagataa ttttatttca aatctagtct tattttttct gaaccttctg 
ggatatataa cattttatat tatcatttat gtcatcaaga atgtagattt atattctttt 
ctattcacat gtttatgtaa ataattgaac acctctaaat gacagatact atcattttat 
atattgttaa taagtatctt catggctggg cgtggtggct cacgcccgta atcccagcac 
tttgggaggc cgaggtggga ggatcacctg aggtcaggag tttgagatca gcctggcgaa 
cgtggtgaaa tcccatctct actaaaaaca acaaaaatta gctgggcatg gtagcatgcg 
cctatagtcc cagctactag ggaggccaag gcaggagaat cgatcgctca aacctgggag 
gcagaggttt cagtgagctg atatcgtgcc actgcactct ggcctgggca acagaatgag 
accctgcttc aataaataaa taaattaatt aattaattaa taaagtatct taatgacaaa 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
900 
960 
1020 
1080 
1140 
1200 
1260 
1320 
1380 
1440 
1500 
1560 
1620 
1680 
1740 
1800 
1860 
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tatttaaata tatataaata tttacattta catatgggaa gagagatctg aaattttaag 1920 

caacttttat ccaaaaacct atttttttca aagttcctcc aacatctata aaagaagtaa 1980 

agtcacttat agatctatct tacttcaggg cctatgtttt cctctaccat acaacactgc 2040 

acattcacag acacagaaat cttttacatt tctaaaataa ataaaacatc caaacttcag 2100 

tttaaaaaaa atggatctac agatccacca tagactaaac gaccaaacat agtcagaggt 2160 

gtttacttgt ttgtttcggt cttaagggaa tcttcaaact accctagagc ctcgtgtatc 2220 

caatgagagc agtctggaca" ttgaatgaat ggtgatgtca ttaaatgagc tatggagcaa 2280 

gatgtaagaa cacgctttag agtaaataac atatttggct tctgaggtat aaaaagggaa 2340 

ggaggctttc tcaggacatt tgggtacctg gtccttcata ctcaggaggt ttttccatct 24 00 

tcatgtcatg gaataaacta tgctttaaag aaatgctgcg tcagcagaaa gtgtttcaaa 24 60 

tataaggaaa ttgtcatagc tttgaaatta tgtcaaatat cattttaact aagagagcca 2520 

ttgtctttcc atatatttat ctagaagaaa atttactgga caaaataaat atagaacaac 2580 

ttcatgtgta gtcattaatg gtcggatagg cagttactat aaaatatgaa taatctaaaa 2640 

atcattttag ccatttgttt ttaataaatc tcacttgtga ttgatttatt aaaaacaggg 2700 

cctccagaag aaagaaactt aatacctttc ctccatcatt atgaaggcac tagtactgac 2760 

gtaactaaaa ttagatggca gcttgaggga ggaagagatg atgtagagat ggtgtgtaat 2820 

ctgtgtaaac agtctctcaa ctgtctagag ttagctgtac gaatctgttg ttggctccag 2880 

ttataagaac caagatatgt atgtaaggac aaacatcctt aatttgctta gcaagtcgta 2940 

tgtatttgat agaccaccaa ttttatttta ttttatttta tttttgagat ggagtcttgc 3000 

tctgtcacct gggctggagt gcagtggcgc gatctcggct cactgcaaac tccatttagc 3060 

aggttcaagt gattcttctg cctcagcctc ccgagtagct gggattacaa gtgctcaaaa 3120 

ccacccccgg ttaatgtttt ' tttcttttct tttcttttct tttcttttct tttcttttct 3180 

tttcttttct tttcttttct ttttttagta gtagacacag ggtttcacta tgttggccag 3240 

gctggtctca aactcctgac ctcgtgatct gcccacctcg gcctcccaaa gtgctgggat 3300 

tacgggatta caagcgtgag ccactgcacc agccctgatt tttaaaaggg attttagaat 3360 

ttcagttgaa catgtgtttt tgaagtaact ccatattgaa caggggaatc agagtaagta 34 20 

aaccactaca ggaaaatcag gaaggcagac tccaatttat gttttctgtg tataacctac 34 80 

tttaataaag aattaaaaat cataagagta gttttatcta gtggcctaga acttgattat 3540 

gaaaagtaag acaaaggaaa tctgacctca tgccagtgtc tggaagcctt acaggaactg 3600 

aaaggaccca cgtggaacag ctggcaactg aaactcatct tgtcaatttc aagtgtagtc 3660 

cctcttccca .attcaccgga gaaaacacag ccagctttgc atacagtgtc tggtgtcacc 3720 

agagaaaaca cagtcagctt tgcatacaat atctggtgtc agaatgacca aacgattggc 37 80 

ttctatcctt caggattgtg aacagagagc caaggcatac aaccagtatg cccactccgt 3840 

cagctgcgag acagaggaga taaagaatgc tgaacattat gggtcactgc tttgaagtat 3900 

tcagaggcca aaatagttca aaataattgt gttaatcatt aaaagtagaa gaagtggaag 3960 

ccaaaattat gcctgcctat atttcagttt tgaatgtgtg gagatcagaa atttaaaaaa 4020 

aagtttatga gtttcccctt ttaaactttg agaaaagacc ataagaaccg gtgttttagt 4080 

aaatcatctt caagagtaat ttactcataa aaagaaaaga actcatgata ccgtgtaaac 4140 

aaaaggctaa taaatgctgg tgtaaataag attaagggag gacaaagatg aaggaaagca 4200 

ggacatatca aaaaacaaat anaataagaa attttcagag cttgaatata caaacaggca 4260 

catcaaaaga ggttcaatgg aaacctaatg caatgaatgg gacaattaat gcttttaaac 4320 

atcacttgga accacagaaa ataccaacag aaaaatctgg tcacgtacaa gagataagag 4 380 

tcagggttcc atggatgttc ttcttagatg agaaggacac ttcactttta tggcattgtt 44 40 

cccacaaatc cataacccaa gtttcatcat gagcaagcct cagccaaaga aaaattgaga 4500 

tgtattttac aacatacccg ttcaggccct ttaaaaatgt caaagttggc caggcaccat 4 560 

ggcttccacc tgtaatccca gcactttggg aggctgagat gggtggatca tttgaggtca 4 620 

ggagttcgag accagcctgg acaatatggg gaaaccccaa ctgtactaaa aaaagataac 4680 

aaaattagcc aggtgtggtg gtgtgggcct gtaatcccag ctacttgaga ggctgaggca 474 0 

ggagaatcgc ttgaacccag gagccaggag gtggaggttg cagtgagcca agatcatgcc 4800 

actgtattcc aacctgggcg acagagcaaa actccatctc cccccaaaaa aagcaaagtc 4 860 

gtgaaaaaca aagtaagact aaggaactgt cagattagag gggactgagg aaacatgaaa 4920 

attagatgaa atagggtata cagtttagaa ttatagaaca gaaagcagga catcagttga 4980 

acaactaaag aaatacaaat aaagtctatt ttagttaata ttgttgtacc aatactaatt 504 0 

tcttagttgt aacaaataaa atatcattat gtaagatggc aacttcagga aaagttgggg 5100 

gaaagatata caggaatact ttgtattatc ttagaaacta ttatgtcaat attttaatat 5160 

aaaattatta gaaattgtgt cagaaacact ggatgttaga agacaatgga ggaatgcctc 5220 

aaaaaatcta cagagaactt cttttcaatc tatcatcata gcatcaatca agtatgagct 5280 

tcaaattatg atattttata atatgagaat aattctgaaa atttacctta taccaatatt 5340 

ttcttagaag attactttag gatgtgctat agtcaaacag aggaataaac taagaaaaag 54 00 

tatgatgtga gatacagaaa aatatataga aacaatttag aaaaataata aaaagtagtc 54 60 

ctatgataat agcaatataa gaatgtttag attggaatag aagatgaatg ccctcaagag 5520 
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tataacctct gaaagaaaaa ttaaataaat aaaaaaattc tgacaggata cctgatcaga 5580 

tagagatgtg acaaaccagt atgttgaaag gtgatgacag aaagagagaa cggaaagaga 5640 

gaggcaggaa ggaaagggga aaaaaagaaa gaaatgaaag gcgattctca caccagttag 5700 

aatggcaatt attaaaaagt caggaaacaa caggtgctag acaggatgtg gagaaatagg 5760 

aacactttta cactattggt gggactgtaa actagttcaa ccattgtgga agtcagtgtg 5820 

gcgattcctc agggatctag aactagaaat accatttgac ccagtcatcc cattactggc 5880 

tatataccca aaggattata" aatcatgctt ctataaagac acatgcacac gtatgtttat 5940 

tgcggcacta ttcacaatag caaagacttg gaaccaaccc aaatgtccaa caatgataga 6000 

ctgggttaag aaaatgtggc acatatacac catggaatac tatgcagcca taaaaaatga 6060 

tgagttcatg tcctttgtag ggacatggat gaagctggaa accatcattc tcagcaaact 6120 

atagcaagga cagaaaacca aacaccacat gttctcactc ataggtggga attgaataat 6180 

gggaacacct ggacacagga aggggaacat cacacaccag ggcctgttgt ggggtggggg 624 0 

g" a 99ggg<? a< 3 ggatagcatt aggagatata cctaatgcta aatgacgagt taatgggtgc 6300 

agcacaccag catggcacat gtatacatat gtaactaacc tgtacgttgt gcccatgtac 6360 

cctagaactt aaagtataat atatatatta tatatatata aagaaattcc aagaaaatca 6420 

aaaggctgca tacaaatgaa atactatatg accattattt ccttcttggc tttttcaaga 6480 

acaacattta cttagctacc acactgaaat tactattcat taattttaag taaactggca 6540 

ttgaaaatat ggttaatctg gtttttcttt ttttttttga gacagggtct cgctctgtta 6600 

cccaggctgg agtgcagtgc tgtgatctta gctcactgca acctctgcct cccaggttcc 6660 

aacaatcctc ccacctcagc ttcccccagt agctgggatt acagctgtgt gccaccatga 6720 

acagctaatt ttatattttt agtagagatg gggttcgccc tgttggccag gctggcctcg 6780 

aactcctgac ttcaagtgat ctgcctgcct cagcctcccc aagtgttggg attacaggtg 6840 

tgagccaccg tgcccagccc agaaatgtgt ttagtagttt gaaggggcaa gtaccaggtg 6900 

ctaagagttc acaggggagg gatactggac ttagaccttc ggaatcaaga aaagctttct 6960 

atagaaagta ctgcctaagc tgaagactga acactgacca ggctgaatga aagatgtagt 7020 

gctggagagg atttcagtca gaaaaaacat ttgagaaggt ccagaagaaa aataaaacaa 7080 

tttatttcaa ggtactgaaa taaattgaga ttaaggggag tgaagagtca aggtctggag 7140 

aggtattgag ttgtcaagaa atacagtttg agaggtaaag gccatattag gaagtaatat 7200 

atttttaaac ttttgaagct tattctaagg ctaataacaa agtggtgtag gcaggaaagt 7260 

tgcagttatt ataaaacttt cagaaagatt tatttaaatt aattggaatt. gagaaagaaa 7320 

ggcattattt tacgtaataa aaatatttat ttgttgtgta aaaaataaat agtttggcct 7380 

agaggagaac agggccagta aggaagagga gaaatgcata gatataaagg gtgttaggat 7440 

aaagcataag taagacttaa tgataattta aacttaagga acaaggaaaa tgaaggagta 7500 

atgaatactt gagtcacttt gtagatagag gcaccaatca ctcaactcag acagagtaga 7560 

ctagaggtgg ttttggggaa gctaaagact ccacttttag atctctggaa tatccaagat 7620 

gggaattcaa ttaggctatg atttcaagtt caaacaagag aaacgggatg gagatatttt 7 680 

tagataccat ctgataatga ttagtgattg aagccatggg aatggatatt ttgtcaatgt 7740 

tcttttgaga gctgtatcct tccgtagtta tctgatgggt tttttaaatg gcgtaaaata 7800 

atcagtaatt tagaagtttc tataaagaaa gttcggctac aaatacttat cctttctctt 7860 

tcattctcaa accatgcatt aac'cacctat taatgtgttt agaaaaacag taagtaattc 7 920 

agggttcatg ctgacagaga agacccacat aagcaaacct cgctataata caaggagcta 7 980 

aattccccaa tataagaatg cagtgtgctc tgtgggaata tggagaaaca aagataagga 8040 

aaacctaggc agaaatagtt caattgattg ttatttcata ctgagacaaa tttgccaagt 8100 

ataaactata tcctattaca aatttagatg ctttgggcca tgggtcagct ccacggcgac 8160 

acctacactg aagttgtatt taagctatct gtctggatca cttggttcac aaaaatgttt 8220 

ggattacaaa aaggaaatat catacacctt ggaagaactg gctcattagt caagatgctg 8280 

cttaccagtt gcctgggttg gctgctgtgg taaagsaata tcggttggaa agccggggct 834 0 

ggaagtcggg gagaggctgt cttctatatc ttacatttgc tatgtctttg gaggaaagat 8400 

gctttcggca gtgttcctta aatttaaata tcttttcctt ttggactcaa gggtatgttt 84 60 

attcgacttc atcatatcta atctcctaga atattattaa cctgaagtta ttgtcagtat 8520 

cgtatgatat ttattgaata acaatatgct ctgatactgc atatgaaatg aaataacaaa 8580 

tgaacctccc ttactttaac ttaactacta attaattaga aaagaacctt cactcaaggg 8640 

gtttggaaag gtccaataag ggaattaaca gatgagaatt. taagagtacc tgaacccatc 8700 

tcgatttgag atttacgtga tgctgggtga ttagtgataa gaacagaatg aagtcagctc 87 60 

aggaggcatc gtgatctgga aaggagggag atatttcagt gaataattga gaggattcag 8820 

acaaagtcat atgagagaag cagagacaga cagagggcag gagtcagtgg atatgagaat 8880 

gatactagga agatgaagaa tggactgatt tgtaaagaaa ttgagaccca agttatactg 8 94 0 

tgttaagaga taagttaaaa ccattaaaaa atattgggag atcctgaatt gctttcaagc 9000 

ctagtaattt gatgtaagaa atagtgagaa atgctcttga tgttgtgtct gtttcaggga 9060 

aagttttgtg gagacaatga aataaataaa atagcatatt tagatcaaga ggctggtttt 9120 

ctttttcaca ggaaacatct aagaaactgt taaaagggac aaggtttgag tgttagaacg 9180 
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tgaaataggg 

ccaaaaaaag 

tgtttatttc 

ggcttgattt 

aagtttggga 

aaaataatgt 

gaataaagag 

aatgagatta 

gtgatgatgg 

aagataatga 

gactaccatg 

acaggtgcac 

tgggaatagg 

gggaattcaa 

tcacagttca 

cagagaccat 

tctagggcag 

gcagcaggca . 

attttaggcc 

aaatgagata 

aaatttaaac 

gcaacagcat 

ttcccttgcc. 

tcctggtggg 

gcctggaacc 

catgtatgat 

ataataaaat 

cttfctttca 

ccatgcctat 

ggtggcctac 

ggatgattca 

gagcagtgca 

ttgaaaaggc 

aggactactg 

aaaaacaaat 

aaattcacaa 

atggtgtaga 

gcaatttaca 

acaatcatga 

agagagagaa 

tagtcactac 

caggttcctt 

ggggacacag 

aaaattactc 

tacacctaat 

attgccaaaa 

cccctctctc 

gggactcagc 

gtctcagagg 

ttgcctgtcc 

aaaacccttc 

atcataggtt 

tcagctttac 

gggagaagaa 

actttgattc 

ccactgggaa 

atttatcata 

aaccccaaag 

agaattatat 

attatatact 

ttacatgttt 



ctatggtaat 
agtttttaaa 
agattcaacc 
atgtaagcaa 
aaaggctggt 
agatgtggaa 
aaatgggtca" 
tgaaaaaaag 
catacctcag 
atccaatttt 
tagacagata 
tcattgagtg 
aagactcaat 
gaagatcttc 
gccatggccc 
ttgccctcta 
ttgaaggaga 
agtttgcact 
agaagtcttt 
gttggcattt 
agcaatattt 
attaaagctt 
ctggtgggag 
atatactccc 
atgaatggta 
aaagtttaat 
agaataatta 
aaatatctta 
gtgatgtgat 
tattgaccta 
tgtcaaggga 
caatttaaaa- 
agtgtgccac 
tattcttaga. 
tagatcaaca 
ctttcaaaga 
tatatattcg 
aaagaaagag 
•tggaaggcaa 
cttgtgcagg 
cataagaaaa 
ccacaacgtg 
ccaaaccata 
ggtttattac 
gccaaagttc 
agttacaggg 
tttccttgct 
ttccatgatg 
agaagaggga 
attgctcctg 
agaaaagagt 
caacccaaat 
gaattttatt 
taaaagaact 
ttcctttata 
gtcagagttt 
cttgtattca 
cattagagac 
atataaataa 
atatatggtt 
ataatttata 



aaattgttac 
caaacatatg 
attttaacca 
agatttggat 
gggctttgtg 
tcctgtaaca 
acgtggatca 
aggatgagaa 
atagaaatga 
gaacacattg 
accaaataga 
aaccagtaca 
gaaggcaata 
tttggcttcc 
agaatggtga 
ggagcaactg 
gagtaaattg 
tttgaggaga 
gacatgct ag 
ttggcatttt 
agcacaaaaa 
tttcttcttt 
tatgtcctcg 
tcttatccta 
ccaagcccta 
ttataaatta 
taacaatata 
ttgtactgta 
gtgatgaagt 
ataagcaggt 
agaattgaat 
cttatgaatt 
aggtaattga 
gcaattgtat 
tgggntggta 
aaaatattgt 
tctgttttca 
gtttaagaga 
gaatgagcaa 
gaaactcctg 
gcacaggaaa 
tgggaattgt 
tcaagatatc 
tgacattctg 
ttacagttca 
agagcctcag 
cttctctctc 
ctgcctcaga 
ggccatctgg 
cttcctgatc 
aacagagaaa 
cttgtttcag 
catgcttcag 
actccgaaaa 
aatacaaagg 
atctatatcc 
gacatgatgc 
agatagatag 
aatttcttta 
gtataaaatt 
attatattat 



cattccgaag 
aagatataca 
caaagaacaa 
ttgaggcagt 
agacaaagtg 
atggcaatta 
tttttagaag 
attatgtttc 
attattttaa 
aattcaaagt 
atagaaacac 
gataaactat 
aggggaagtt 
ataagatcct 
tgaccgaggg 
atggagagaa 
taaaccaaga 
aattctagga 
agagaaagat 
tggcaaagaa 
atgcttgtca 
cctgcaatag 
cttatcctgg 
gtgggacatg 
cacacactat 
ggcatagtta 
ctgtaataaa 
ctcacccttt 
aaggtgaatg 
tgtgtctgca 
gggacagcac 
gtttatttat 
aaccacagaa 
atcattggaa 
ccattaggaa 
gagtctctgc 
tggtgctaac 
cgcacagttc 
atcacgtctt 
tttgttaaaa 
gatccacccc 
gggagttaca 
agcagaaatg 
atgtgcattc 
aactctagga 
tgtttgtttg 
catcattctt 
ctttctatcc 
agaaggtaca 
cctgccaggg 
actcaccaaa 
gttacagata 
aaagttgcta 
tctgaaaaga 
atttatggtc 
atcttcacat 
tattattatt 
atagatagac 
tataatgaag 
atatattata 
atataattat 



attatcaaat 
atataataaa 
aacaacctca 
gaatatagga 
accatgccat 
gtgatgaagt 
gaaaagcact 
caaggtttat 
aggaaaccag 
gacaatccgt 
ccaccatata 
attgtagggg 
aaaaagaata 
agactgtttc 
cagggccatg 
taaattgggg 
atgccatagc 
agggatttct 
ggattttttt 
agatttcatt 
aattcctgct 
ctgatgttca 
tgggaatatg 
ttccaagact 
gttttcatca 
gagataaaca 
agttgtctga 
tttgatgatg 
agaaaggcat 
gcatggagac 
agattttatc 
agaattttcc 
agtgaaactg 
tattattgat 
agatggtaga 
tatgagatag 
aaagacacac 
cacgtggctg 
acacggttgg 
ccatcagatc 
atgattcagt 
gttcaagatg 
agcaaaaaat 
cttaagagat 
gcaagagctg 
gactctccag 
cccagccttc 
atacggtatt 
gggcacctca 
gcacacctgg 
tacaagccat 
gagaattcca 
aacagtgtaa 
gttaaaataa 
ttttgcatca 
ttattacaac 
ttccactggc 
agagatattt 
aaattcatat 
cattatatat 
ataatttata 



tatctcccag 
taaactagta 
actgagagca 
ggcattggaa 
gagaattgtg 
ggtgacatgg 
gaaactttgg 
tgcaaaagct 
tttttagaga 
catccatgca 
taataaaacc 
ctctcacatc 
agataaactg 
taggaagcag 
gtttggtgct 
agagaataaa 
taaaagggaa 
ccacatttcc 
tttaatgaat 
ttcaagacca 
gaatttgaaa 
cgatacagta 
ttccccttta 
cccagtggat 
tatacataca 
ataatagcta 
tgtggtctct 
tgagaagata 
catgacgcag 
tctggaaaag 
acattactca 
atttaatatt 
cagatgaggg 
agtttaatga 
ttttcaattc 
atattgagaa 
ccgagactgg 
gggaggcctc 
cagcgggcaa 
tgatgagaca 
tacctcttac 
agatttgggt 
gagtgatttt 
aattacattt 
cccaagaacc 
ctcctacctt 
ccaggttctg 
attgcactgt 
gcagtgtctc 
agaaacttag 
tcaatgtgcc 
gtgcttctta 
atgaatattt 
attccagtgc 
gccagggtgg 
cacacacagc 
atggctttta 
atacaataaa 
aaaggaatat 
tatatgtaat 
aaattatata 



9240 
9300 
9360 
9420 
9480 
9540 
9600 
9660 
9720 
9780 
9840 
9900 
9960 
10020 
10080 
10140 
10200 
10260 
10320 
10380 
10440 
10500 
105 60 
10620 
10680 
10740 
10800 
10860 
10920 
10980 
■ 11040 
11100 
11160 
11220 
11280 
11340 
11400 
11460 
11520 
11580 
11640 
11700 
11760 
11820 
11880 
11940 
12000 
12060 
12120 
12180 
1224 0 
12300 
12360 
12420 
12480 
12540 
12600 
12660 
12720 
12780 
12840 
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taaaaattgt gtatattata aaatatg.taa tatataattt ataatatatt ataatttata 12900 

tataatacat aatatatatt ataatttata tataatacat aatatataat atattataat 12960 

ttatatataa tatataatgt attatatata tgatgtattg tatatataca tacatatttc 13020 

caccaatgtg attttaaaag actaccataa actcactgaa aagaaaacat aacagttggt 13080 

ttccaagaca tgaggacatc agttggaata acgtttcttc catagaatta aatgcatctc 1314 0 

tggtgtcttg gaaaggcaat ttcctctcaa atgggtaaag tgtttgtact aatactgtat 13200 

aaaaatgaat acaataaaat" aggcgcaaag aatctcacaa cttaaaacat cccatcttta 13260 

ttctatagct gtaagctcag agagacaatg atcccatgat tttatgggag tagaaaaaca 13320 

ctttattgag tgttttctat gtatcagaca ctgtgtacat tttgctatgt tgtcccgatt 13380 

ttattaaatg cagtcctttt cctgttcagt gtcaccattc ggaggttttt ttcaagacac 13440 

tctggtgctc catgagatct aatctccatt aatgtgacat atatgctgat attatgtaag 13500 

tgtgtgtgca tacgtacact gacacacatc gtttgcattt gtaatttttt tttacccagt 13560 

tgcctcataa aagattaaaa tggaagatta gactccattc agagtttgac aatgcctcaa 13620 

taataaatca gttctcatca tttgaagaac tatgaggtgg aacaaaaact gatttgagat 13680 

tcataccgtt gtatacctac tctttaaaaa aaagatgcaa tttacacatc tgcaaatctt 1374 0 

aggaaatggc ttagagaatt tgcatgattt gggcacatga agccatttta gcacataaat 13800 

tccttccttt catctctaac gggtgtagaa aggatacaaa catatttttt gcagatgatg 13860 

tgaattattg actggcataa atagcaactc acactaaatg cttcagaaat gtgtttttta 13920 

attgagtgaa gatggaataa aaaaatgcag gaaaagtcct gttctatgag agctgaagat 13980 

atggaattag atacagtatt tgtgagtgtg tgtgtatgtg tgtgccatag ctaatattat 14040 

aatttttggt tggtcttcct tgtccattta atgtttgcag catttgttga tggattttta 14100 

ctaaatgatg acaaaataat gaagcagatt agtaagaaaa cctacaaatt gtttactaga 14160 

tcttcactat ctatattaaa tagatgaact aaaaggtaaa tatggatttt tacctactaa 14220 

agaaaactac tagaatttat tgcactttaa aaatatcatg tagtgtgttg taaaatgaac 14280 

ccaatcagat gttctcttaa aggactatat tagggtattg taaatagttt tcctaaaagt 1434 0 

ggattttttt tttttgcacc acaattcaaa taatgccaga attgccagct ataaaccaaa 14400 

tgtgaagtcc atgagcttga aaagcccata gtgttcacgg ggatcagtcc ctttttttac 14460 

actggaaaca gccccagaaa gttccagcca gtaatgaaag atggtgcaca atggctgtga 14520 

cagaaccgtg atcaaattcg gtcattgtga gtcttctggc cagcgtcgat tttctaccac 14580 

atgtgttttg cggcctccat aagatgttaa aggaacactg aatcattggc aatccataat 14 640 

actcagattc ttggcttaga gagattattt accccggggt gcctttctaa cataaacact 14700 

agccttgggt atagtaacag ggagtacata gaatagtcca actcatatga atgcaagaaa 14760- 

ttataatgtg attaattttt ttggtcattt caatttattg aaggtaattc agttggctct 14820 

gtgaatgtat ataggaaatt tagaaatatt aatatgtttt tatttttttc ctctaggtta 14880 

actgaagtga atcaaactat tggattataa ttccatgaag gtaagaaaga gctccagatg 14 940 

gttttcatcc tatcaccagg acctgcacag tatgagtacc tggtgcataa caaactttat 15000 

taattatctt ttctctcttt ctttccccaa ccatgtgtat acatacacac gcacacacac 15060 

atatatatgt acatacacat atatacatgt gtatggatgc ataatcacat acacacgtgt 15120 

atatatacac atatatacat gtgtatgtac atactcacgt acacatatgt gtatatattc 15180 

acatatatac atgtgtatgg gcatatacat atatataata catgtgtatg ggcacattcc 1524 0 

tctatataca cacatacaca cacgtacata caataaataa atggttgaat ggcaaaatga 15300 

ctaaatggat ttagtacaac ctaatgatat gccacagatt tgtcaaagag ccctggaatt 15360 

gagtaaaaat gaatgaacta aattccaaga tcttttacaa cctatgattt cattccagca 15420 

gaaatttgta aatatgtgta ccatagattt atacaaaacg ttaatggtgg .cattatttgg 15480 

aatagctcca aatgggaaaa agcccaaatg tcaatcaaca gtagaatgga tggattttgg 1554 0 

tatatccata gaatggaatt ctagaccaca gttaaaatga gtagattatt gctacaccca 15600 

gcaacatgga tgaaactcaa aactccttga gtggataaag ccaggctgtt tacgattaca 15660 

ttctctgtaa gaacttttat tcaaatttca aatacaagca aaactattct acagtgagac 15720 

aagtcagtgg ttacctttgg cagttagtgt acccaggctg gggcaggagg gaagtttcta 15780 

ggtgatgaag atgttttaat tcctgatcta atcctagctt tactgatggg tttactttga 15840 

aaaattcatc aaggtgacta tgcactttct gtaggtttac atgaaaattt caatgacaca 15900 

cttgattgta tgactttgga tattactttt cttttctgtg ccttaatttc ctcaattgaa 15960 

aagtgaggaa gagaatgctt acctctaagg attgacatca agattagtgg ctgagcacgg 16020 

tggttcacac ctgtaatccc agcattttgg gaggcagagg caggcagatc acaagatcag 16080 

gagtttgaaa ccagcctggg caagatggtg aaaccccgtc tctactaaaa aaatacaaaa 1614 0 

aaaaaaaaaa ttagccaggc atggtggcat gcgcctgtat tcccagctac ttgggaggct 16200 

gagtcatgag aattgcttga acccgggagg cagaggttgc agtgagctga gatcacgcca 16260 

ctacactcca gcctgagcaa cagagcaaga ctccatctaa aaaaaacaaa acaaaacaaa 16320 

aacaaacaaa caaacgaaaa gattagatta gcaagtagag tatgtgtaaa accacctgga 16380 

aaaatgcctg gagtgtgtgt tcagtattta attgtagaaa cagttacctt cataaacaac 16440 

aggacattgt gggaacatct gttaattacc gaacagtctt tttaggaccc cctggggttt 16500 
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gtggatagta 

tcaaattaac 

tgtttttaat 

tatgtataca 

ttattttagg 

gattcattca 

tacactaagt 

gaagtcctaa 

aaaatataca 

tacttaactg 

tttattaatt 

cttaagtact 

agatctctaa 

cattaatatt 

attatatata 

aaacaatgct 

ggaaattggc 

tcagaaaacc 

tatattttca 

gtcccacagg 

acgggagccc 

tttctttgtc 

ttacctatag 

acagtagttt 

atatcaatag 

gttttataac 

gtgtataaaa 

tagttgtgaa 

ccaaaaaaga 

gtcagagcat 

cggctaagat 

attttgtaca 

caatagcata 

tctgactgca 

attgattaaa 

aaaaaaaaga 

aattagtgtc 

tgagaacatc 

tgccctttca 

ctcttccatc 

ggacccagga 

gtaactgcag 

tcacacagct 

caactgttgc 

gccccttcgt 

gtgctgaata 

gaagatggaa 

ttttggcaat 

tatttgaaag 

atcagatggc 

aagtttgatg 

tcaattgaga 

caacagagca 

gctgtaaagg 

atttttaaaa 

acatatgaaa 

tgttttgcct 

aggcaaataa 

tctttgcact 

tatggccatg 

tgccactcat 



atttttatta 

cagcagttaa 

tgacaaataa 

ttgtggaatg 

aactacgttt 

ttcttcattt 

gtcgggatta' 

aagagagaac 

caacaaaaca 

gtttagaaat 

atataattaa 

tttaagtatt 

aacattttta 

gtttacatgt 

aatatattta 

acatatatct 

cagcatttca 

agtgtggaga 

ttttcttttt 

ccactgggta 

atggacccct 

tcacagtcta 

caaaagatgt 

ttagtacttt 

gcattaatgc 

ttactgcggg 

caccttgaaa 

agtttcaaaa 

gggaaataca 

tgtgaatctt 

taaaatgtaa 

tatatagctt 

tgtgagtttg 

cattaggaaa 

ttaaaattta 

ctccccaggc 

aacgtcatca 

tagagagtgc 

tgtagtctgg 

ctcagggttt 

aggtgaaaac 

gttgagggtc 

gcttctagag 

tagctctata 

agacagtacc 

ttctatgcca 

ccaagagaga 

ggaatctgag 

gtcacgaaag 

aggggctggg 

ataggaggaa 

ttgccaaact 

ctaataatat 

cttagatttt 

tgaacaaaga 

cgacagccat 

tataacttca 

agtccatttt 

cttacccttt 

ctctcctgcc 

catcatgact 



ataacacctt 
gagctatgct 
aattgtgtgt 
gctaaattga 
ccagaagtct 
aacccaggaa 
ctaacatcaa 
atagacagac 
tataagataa 
gttttgaaga 
cttttaatta 
cagttcagtc 
tcttgcaaaa 
taacacattt 
taaatataca 
aaggcttcta 
gtgtgaagga 
agatgcataa 
catagtcaca 
atagcgaact 
ccccagctcc 
gaagagtgac 
acatgtaatc 
tataaattca 
cagtttttta 
gtaaatgaga 
taaaagtttg 
tgacagatgc 
gctcaaagag 
cggctttgat 
caatatcagg 
gaaatgcata 
aaatgtaatg 
tttactgctt 
tggggatgtg 
gattataaga 
ttgtgtgggc 
aggttgttcc 
gcaccagagc 
tggtaagcac 
aggttttgtt 
atcctcagtc 
tctgcttagc 
gttctcagcc 
ttcattaaat 
agcctctgcc 
aaatgacctt 
catgtagagt 
gcttcttccc 
gcatcaatgc 
taaagaattt 
cttttaatga 
caccgagact 
tgatagctac 
tataaagaga 
ctctggtaat 
taaagattca 
tggtcagaat 
taaagaaaac 
caggtgttct 
tcagtcctta 



tatatgggct 
tcttcttttt 
atttatcttg 
gttaattaac 
gtaaggtaga 
ttttttgagg 
ggaacagtta 
acataaaaac 
acacagaaat 
aatgtatgct 
tgataaaata 
atgttcacta 
ctgagactct 
cattatatta 
tgaaatttgt 
gtcctatatt 
ggaaggatgc 
taggcttgac 
atgaaagtga 
gtgaagggtg 
tttgctccca 
atttcaaaaa 
acattatgac 
aacttagcaa 
atgccagtga 
tatttagtta 
aagatgtcct 
attttaatgg 
taaagcagat 
cttgagtacc 
ggaaatgcat 
tgggaaatgt 
cggatatata 
aaaatgacta 
cgcctgaatt 
aactgccagc 
tgttgttgga 
tcttcttgtc 
accctgagcc 
catctagagg 
ttcccagctt 
aaaggccaca 
tgcttcaggt 
atcttcagtt 
attcctcatt 
tgctgcagta 
taaaagatgt 
ttaaatgtaa 
gagaggaatt 
aagtgaagaa 
ggatttatta 
aaagtagctg 
attgatgtga 
tcaaggaaga 
gaaaatacag 
caaaagaagg 
gaaaatgggc 
gtcaatttat 
attaattgtt 
ctcccttaag 
aagtttcttt 



gaaatcacaa 
gccaatttac 
cacaacaagg 
atatgcatta 
tggattgata 
gtcagtttca 
ctgacacaac 
atagtataac 
gtttggagca 
aatatgtctt 
cataatataa 
tatttatatt 
acaccctttg 
taatgtatta 
atatttgaag 
ggacttacag 
cgtgttctct 
gcatttttta 
aagcaaagga 
tqqaqtqqac 
cagcaatctt 
ttatagcagc 
atactacaaa 
ttttccttat 
tttttaacct 
tacttttgaa 
atgagttgct 
cagagtagga 
gcagagaaca 
tagctagata 
ttgagaacag 
aggtggagat 
atctaaaact 
aatctcaggc 
ttttttttca 
tttggaaact 
tggatgagaa 
ttctatttgc 
cagtggactg 
aggcatagag 
cctttctgct 
gctgcagcag 
caagggatga 
gattccccta 
ggttaaaaag 
acataagaag 
aacgaggtgc 
gaacggaaaa 
ttagtggtgc 
atgaagggtg 
aaaataagac 
tgctctatct 
gggcaatata 
acacaaagaa 
aatgtatacg 
ctactacaag 
catagccact 
gtatcttttc 
tcactgaagc 
ccttctcttg 
aataccactg 



tgatactgtg 
ttgttttatt 
tgttttaaaa 
cctcaeacag 
tttttaacct 
gagcatgcgc 
aaatcgatgt 
acaatatatt 
atgaattaga 
ttgttcaaaa 
aatttattat 
gatgtggaac 
aacaaccttc 
tcgataacat 
tattagtttc 
tggaaatctg 
cccatctgtt 
taatgatgtt 
aagagctaat 
tcactgggag 
ctcccttctc 
agaggtgttt 
acaggtaaaa 
ttcaacatta 
gctgaaattt 
agtattgctt 
gtgaaaagag 
gttcatttat 
gcaaacaacg 
ttactgtcaa 
taaaacattt 
ttttagtagg 
ggttctcaat 
ccaactgcag 
ttaaaaaaaa 
gctggcctac 
caaggatgga 
ccttcagatc 
caccagtgga 
agaaaatcag 
gggtcaaggg 
gtgcacctct 
tgaggtctcc 
aatcctgcct 
tagtgacctt 
agttgtggct 
ccagggaaag 
atatctgttg 
gctaacaaaa 
gttatttgac 
atggagctaa 
ccagcagttg 
catactcagt 
ttaagtcttg 
aataatcaaa 
caatgatata 
attggcaaga 
tgaagtctgg 
tctgagttca 
agcacttatg 
gactataaaa 



16560 
16620 
16680 
16740 
16800 
16860 
16920 
16980 
17040 
17100 
17160 
17220 
17280 
17340 
17400 
17460 
17520 
17580 
17640 
17700 
17760 
17820 
17880 
17940 
18000 
18060 
18120 
18180 
18240 
18300 
18360 
18420 
18480 
18540 
18 600 
18660 
18720 
18780 
18840 
18900 
18960 
19020 
19080 
19140 
19200 
19260 
19320 
19380 
19440 
19500 
19560 
19620 
19680 
19740 
19800 
19860 
19920 
19980 
20040 
20100 
20160 
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taggactgac gaaaagaccc attcaacatt ccattgaagg attaaatgct atgttttaag 20220 

atataatttt ataacataca tctatcatga aatccattga ctcttcacaa ttagatgctc 20280 

ttacatagaa aacatagtta cttacaaagt catggttgaa atgaatctct gctgtaacgc 20340 

cattacatat ccctaaaaca tatttaagaa ttatgaagag gagcattatt gaaaacaaaa 20400 

tgtattgaca ctaaagagta actattttaa aaaggtaata tcatgatact catatgaata 204 60 

tcaaatgaat gcatgtcaaa tattttattt aactctttaa ctaatgaaga aatgggtatg 20520 

atgctacacc caattcaagg"* tagaattcaa aattcaaaat aaacacatat atacataaat 20580 

caggaatttt gaaatgaatg tacaaaatga tgcaaaactg gctttttcca tggagattgg 2064 0 

ggtcgttgtc caggaatcag tagttatttc cagggacaaa attcatttgc atttctatgg 2Q700 

gcaagtgtca tttaaactgc aatcagttat ctacaattta tggaatcgta accactttaa 20760 

aaaagatgaa aacagttaac ctgaaagttt ctcctgcact aaatgagaaa gaagctcagt 20820 

tatttttaaa tttgcttatt tttgtcttgt tcttttttaa tttcacaata tttgactgac 20880 

agtaacttac ctgatagtga taaatcaatt tgtgccacat ctcatggcat atcagctttc 20940 

tgtgtctttg tccatctcaa acaaccccac atgcctacct caacatgagt tttatctccc 21000 

ttcaaactgg attaatggta tccatgttac atgactttat acgtgttctt cactttacag 21060 

caaatagtta tgaacaatta atggaacaca cacacacacc cagataggaa tgggaccctc 21120 

tgagttttgt tcacagagtc tatactcttt tctattgtgt catataatgg tgacaattca 21180 

attaggtggc ctatgtccac aatccatggc ataaacgccg gtaaatttac agattctccg 21240 

catcctattt ggcaagtgct tcagtgatat ctgtagcctg aagagaaaag ctaagttttt 21300 

cctatttctc tctcttcttt ccttcccttt tcttttgaat ttcctaaaac aaaggaagaa 21360 

aaaggagggg aagaatttgg caatgtgtca gttaatttca tcaatgtaga caaagttaaa 21420 

ttaggaggct gaattaccaa gactggctgt ttccattgat gaaattaaat caggtaaatt 214 8 0 

gccagtgctt cacaaatgct gtctcaggtg tgactgaagt tgtgattgat tggaaagtac 2154 0 

tcacagagaa aacaagtaca ggaaaaacag cactattcca aaatgcagaa caaaaccaca 21600 

catatttagg ctttacagtt ttcgtgctcc ttcatattct tactttgatg tatttctgcc 21660 

ttttaattta gscttttcct aaaaaaaata gtttgcttta cagtttacaa agtaagatat 21720 

gctttaagag gtagaaaaat gaggcaagtg aaaatgaatt aaaaaaaaaa aaagaatcca 21780 

ggagcaaagt cagtccacaa agtttgtctg tgacatcatt gtccttccaa acaaggagct 2184 0 

tcgagttcaa ctccaaacat tagcagtgca gacgggaaaa tgcagtgacc aaaatcagag 21900 

tccctgggtg aaaagcaagc catttaatag gaatgaaaga gtcctgctat aaatactaca 21960 

gaaaataatt ttcaaatcta ctatcctaag gagaacactg tgcaagataa tgaatatcat 22 020 

ctttagaaac atcttcttgt aattatttct taaaatatcc ttccagacaa agcaatgaca 22080 

tcattacaaa ctccaactta gcaaaacgct ttttggaaaa aggaaagtgg aaggacattg 22140 

agtgatgtcc acatatgtag ctctctgatg atttgacttc atccaagaat cgtgtggaaa 22200 

cagctagaat ggggtgaatt ttatatctcc caaacagttt ttcataaata tcattttatt 22260 

tgactaagct ttcaagtgtg actcaatgat agtaaagggg agaacctaat tcctgacaat 22320 

tgcatgagat gttggtcttc agtgttgcca gttaactgag aaactatgtt aataatcagc 22380 

cgcgttagcc cttttctgtt agataatttt attattcgca attctctttg atgggtatta 22440 

ggtttagaca attgaccagc aaactatttt tttttgaaaa tgaccacaaa catgttaaaa 22500 

atggatgaac cctgagagga gacaggaaaa cagcacctaa aaatagtaga aggatattat 22560 

ttctctgcca ttatggactt ataacataat aaacatttag ttggttcact tttttagtat 22 620 

caacactttt attaatfcgct tcttttttta caaatttgat tttattattt tgatacctac 22680 

tgatctctaa tttcttgaga aagtctgtct tactttaata aaatgacata cagtgcattt 22740 

taaacttaaa tactgtatat tcatatttaa atctgagatt attgggacat aatgatataa 22800 

gaatttatat aaggccatcc tctgaatcca tattcaatta aactaaaaaa ggatgtccga 22860 

tttaggaatc ccacagaccc atcagtaatc aggcaagtac ttaacagcca tcctatcatt 22920 

ggtctctcta gtttttattt attcttgtag tattagcaga tgatttctat aatgttgatc 22 980 

aaatcagatt tacgatctaa gttgagttaa atctcagaga atcaaactcc attgtggaat 23040 

ggaagttttt agtttcttat atatggcaat attttigtaag atattttcat aactaacaaa 23100 

gtattctcat tataaattaa gtacatttct cccttaagct ttttaaaaaa aatgatttta 23160 

gaaaattcct gaaataagta tcaaaatgca agattgtaaa atacgttttg ataaatcaga 23220 

aacattctta ttaagagttt tctttattga taagttatac ttctatgttt ataaaatatg 23280 

ttgtattgtt aaatgaagcc aatgaaattg atcaagggtt aggttttaga atatttctgt 23340 

ccagagcttt gctcctccaa attctgtcgt gttagaattt tattttttga ctacaggagg 23400 

atagaagcct aaaacatttt aaaactacat aatccgtttt taaaattaca catgtaaaat 234 60 

ggtgtgcaca aataaatact ctgtcacaca aacatgcaca cagagcataa acacatgtac 23520 

acacattaca tattaatgct tcttctagga cagagattaa taagtagtag agaaaaaaga 23580 

attacagata caatctcaaa gcttgtcata tttctacaag gtaaatatat atctcagtga 2*3640 

tagtgctatc aaaaccatac catcaggtta aagaagtggg taaatattat cagaatttaa 23700 

tgtgaaaaca ggattatatt cctactattg cctggtgaca tagtactcaa cacatttttt 23760 

ttcgtgttat tcatgttaaa tgttaatctg ttaatttaaa ttaaatcaaa ttaaataaaa 23820 
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tctgttaaag aaaattgttc aaatagcttc tccctaaagt tttctttagt tcaaaagatt 23880 

ctgatgctat attttaacct acagcttaca aacaaatttg ggtcttttaa aacttccatt 2394 0 

ccctcaattt tagtaaaaaa . gaaccctact tttataactt acattaaaca ttggaaataa 24000 

tgtataactt acttattagt cttaactttg ggagagaaga ggtttaagta cgtattttaa 24060 

aaaaaacaac agaaatgagt actcatgata aatagtatat gcacctaaaa tatactaaaa 24120 

ttaatgagta caaatactca ttcagctctc aaagtaactc tcccaactga ataaacacgt 24180 

ttaaaaatat tttaatgtat ttatttgtga gttacacaaa tccattagtt taatatttag 242 4 0 

ataatatgac aatgatatat gagacacaat ttattttaag aatatttata taataaataa 24300 

ataaataaat aaataaaaat aacataacct aagtatacct tagaaaaact atggggaaat 24360 

gttaaagata aggcttggga actttgccag cacagtgtgg gtatcatagt gtaggtattc 244 20 

ttatttcaat taaactagtt gctttcagtt taaaaagtag atgattttae atacaagttt 24480 

gcacttctag cttctctttt caaaatgagt tcaggccttc ccatcctgac accttgaaaa 245 40 

agctgtggcg aggccgggcg cggtggctca cgtctgtaat cccagcactt tgggaggctg 24 600 

aggcgggcgg ctcacgaggt caggagatcg agatcatcct ggctaacatg gtgaaacccc 24 660 

gtctctacta aaaatacgaa aaattagccg ggcgcggtgg cgggcacctg tggtcccagc 24720 

tactccggag gctgaggcag gagaatggcg tgaacccggg aggcggagct tgcagtgagc 247 80 

ccagatggcg ccactgccct ccagcctggg gaacagagag agactccgtc tcaaaaaaaa 24840 

ccaaaaaaac aaaaaaaaaa acaaaagctg cagcgaggcc gtttctgaag atggcgggta 24900 

ttgggcactc tttacttcct tgactctact cctgcactct gagactgacc gtataacatt 24 9 60 

tccgattttc tttgatttgt gctattgttt ttcttatagt taagggaaat gtgaaatatt 25020 

attgtacctt. tgtctgtcaa aatttagaaa aatgaaaaat agaccaaagg tgggagtata 25080 

attcattgca aaaacttgat atcctaagct agtgtgtcaa aagtgtacaa tttcgagcat .25140 

tattatgaaa taaatttagt aagatctatc accaatatat aagaaatagg atacaaagtt 25200 

tfcatgaacta aaaagaaaat tctcattgaa ttgtttttta atcaaattaa atgaagctac 252 60 

atcgaaatat ttaaaaacat gttggcaatg atgagtctat aaagcaattt agtgggtata 25320 

accctcccaa tttgctgata tgggccctgc ctggttttgg gggccttgaa tctgcaatcc .25380 

ctggtggagg acaacacccc tgtttctccg gacctgagag tttttctttt aatctaattt 254 40 

ccttcacatt ttaatcatta gcttatcttc aaactttttt aagaatgatc ttaacaaatt 25500 

aaaagtactt tacaatgtat tgcatttact atgtttgaat aaaacattat tttgccacta 25560 

tcccaaagga ttgttatatt tttctatttt agtaagttta ttgaagacaa acaggccttg 25620 

ggccgctgcc tctgtctctc ctgccttccc cactgaatct ccagcccaca ttcacaaaca 256B0 

aagacaaacc tgtgtaaact gaagcaccac aaaactactt ttatgaaagc agaatggaag 25740 

gtaagtgtca taatctgttc ccccttgtac tgtagcataa actttatcga actcactgca 25800 

gcaaatacct ttgtctttca ccaagaaatg gaaattgttt ggtgtaattg tgccttacct 258 60 

tcccgaggta tatttctgga tttgatttac tgaattcaaa gttgaatgat tcaccaaact 25920 

tcttggtttt caatgcttaa agtcccaaat aacttttctt aaaatcagag ctgccaaagt 25980 

taggaatgat tttaatatca attcccgaga ggtgtagaag tagctactgt gctatttctc 2604 0 

acactttatc aaccaaagtc accttcacaa gcatcctcag gatgaagact tcaataactc 26100 

ttaggctgtc catttctcta atgcacattt gtggttgtca gaaaattttt tttatattga 26160 

tttgaatcaa ttgatccatg ttttgcccac cagaacaaca gaaaccaaat gtattctttt 26220 

tgtagtactt ttcattttta aaaacaatta ctagaatccc ctgagtcttc ttttgttctg 26280 

tggtccttcc cccaggacag tgtgtccagg tggactgccc ttccaaccct ctcctctctt 26340 

tcagacaact ttgtccatgt ccctttgcag aggccctggg cagagattcc cgccgggagg 264 00 

tctgctcaga ccagtgcacg ctgaagctaa tgagtcattt tatatggact tggtgatgat 264 60 

ctcagtgtca taaaaaggag attccattat ttctctttca aatacagccc tagctgctga 26520 

ggcagatctc tctgattctg tattttaaat tgatttttga atttaaattg agagatttca 26580 

ggtcttatac ttttagagcc aaaagagaac gttcacatca actggagatt agggttaatg 26640 

ttccagatca acggccaaat agttacgtgg gcttgaacaa ctgtcccagt tggcctcctg 26700 

ggacagctgt ttgctgtaat atgggaatag gaatgtcatc tccccctgcc ttacaggact 267 60 

cttgtgagga tctactaaag aatacccata aaagtctttt aaaaaggtta gagaacagaa 26820 

gatgtcattg gtttatacgt gtatcctgat tccactcctt ccatatcatt gtctcttcgt 26880 

gaaaaccaga cttctctaaa agctcttgta ttattatttt ttttttttcc aaacaccaac 26940 

caatactctg acaccatctg tgtggacagc agttcaattg gattctaact aaccagagtt 27000 

agagaaaacc ccacgggatg agggcccaat gccacaagat tgcccccata tcacatgcca 27060 

gccacaaaaa gagtgccagc cttcctggtc tcctgccagg cccactacaa acacagggtt 27120 

cccaccattc ccacccctcc aaatgtgaca actttttaga acaactcata gaactcagga 27180 

gaacactaca cttactgatt ttttnnggtt tnnattatga aggacagttc aggaccagtn 2724 0 

caaatggaag tggtgcacag gacgggctgg gtatggggta ggggaatgac tgggagcgtc 27 300 

ctcaccctct ctgggggcac caccctccca gcacatccat gtgtttngcc agcttggaag 27360 

ctccttgaat ctcttttngc tcacgagttt ttataaccca atctctaata ccctcctttc 27 420 

cctggaggtt agggggtata gctgaagatt tccaacgcct aatcaccatt tggtctttgt 27 480 
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tgtgaccagc 

taaacacagg 

ctagtcccga 

ttttattaca 

tgacaccact 

ctcgtcctac 

tcttgaagca 

gcttctgact 

gcaactggtt 

tatttatatt 

agacaacact 

gaatgtcagc 

ccagtcagaa 

agatcaaata 

gtgttgttct 

actttggagg 

ttgggcctcc 

tggtattact 

tfaaaataaca 

aaatcctgct 

ccatttcctg 

tcaaacaggc 

ttaaactaac 

aagcaactac 

attggtcttc 

ataaagaact 

tctgcaggct 

aggctaaggg 

aagggggaag 

caagaataac 

cccgacacat 

taacatattc 

catttcctac 

gacctttgtt 

tggggaggct 

atagtcagac 

tgtagaatca 

ttttcaacta 

ttctggggag 

agtctatgaa 

tatcttttaa 

cttatgtagt 

ctcagtggct 

aggtcaggag 

caaaaattag 

caggagaatt 

actccagcct 

aaataaataa 

ctgagagttg 

aagtataaaa 

ttactgaaga 

aactacctgt 

tgtcagaatg 

gtttggcaat 

ttctggacct 

ataaatgaaa 

atttggagat 

actgtgatca 

ggagcatgct 

tcaaggagag 

cccctagaac 



cctatcctga 

tgtagtagaa 

gggttttggt 

ccactgctgt 

ctccgctgcc 

tttccctttc 

cctctcacttr 

cttcctgctt 

taaatgataa 

gctaagaaat 

agtcaggcct 

ttgagctatt 

gcagtctact 

aggcaactgt 

ataaaagcct 

tgcccaattc 

atttactttc 

gtcttttttt 

taagaattaa 

aaaataattt 

acagtatctc 

caaggatgcc 

catatgggca 

taagaccaga 

taaaaatccc 

acctgagact 

taacaggaag 

gaagcaagca 

tgccacacat 

aagggagaag 

ggggattaca 

taagacgagg 

attcaactga 

actgcctttg 

ttgaggcgtt 

ttgattaaat 

aattgtccaa 

tctatacagc 

tcagtgagaa 

attaagggta 

aactaaaaac 

tcattcctgt 

cacacctgta 

ttcaagacca 

ccaggcatgg 

gcttgaacct 

gggcaacaga 

ataaataaat 

gttaagtaat 

agttaaagtt 

caaactgtgg 

agatgacaaa 

gagaagaggg 

gtttctcctg 

tcccatgaat 

ttgtgtcctt 

aagggctttg 

taagaacccc 

tacaagaaaa 

agaccttaca 

tgtgagaaaa 



agctatacag 
aggggcttat 
agttctgtgc 
ccatactggt 
ctgtcacttc 
atgagcacaa 
tggagacttt 
tgtctccttg 
tttgatatac 
gtgctaaaaa 
cataagtgac 
tcttgtaaat 
cacttataat 
ataactgaaa 
cctctttctg 
atgaattgct 
agataacagt 
tttttttttt 
cccaaaatgt 
ccagaatttc 
cactagtaat 
cataaggttt 
tctttgagtt 
gaataaactt 
tatgattttc 
gagtaattta 
catgacttag 
catcttacca 
tttcaaacca 
tctgcctcca 
gttcgagatg 
aaactgacca 
atcctttaaa 
atcctgagac 
tactccacac 
gagaattatt 
ttatatccta 
agtttcataa 
tcagatacat 
agatttgctt 
aacattgatg 
tctcaggaac 
atcccagcac 
gcctgaccaa 
tggcaggtgc 
gggaggcgga 
gtgagactcc 
acagtgtcct 
atcaccttag 
gctggtacaa 
tgtgaatctt 
gactgaatgg 
gaattctcta 
tgctgggaat 
tatataattt 
gaagttcata 

tggaggtaat 

aatatgactg 
ggcaatgtga 
agaaacctac 
taggtttatg 



agggccgcac 
tatgaataac 
caagaacagg 
taccagcatt 
cctgaacagt 
gaagtagtca 
aataccacaa 
gagatatatt 
agatatatgc 
gtggagacaa 
ctcaaccttg 
gcctgtattc 
cctataactg 
ccaaccaaat 
tttcctcagt 
tgctcaaata 
aattatttgc 
tacaatttaa 
gcattcctat 
actgaaagga 
aaaaataata 
gacctgatat 
tattctgtag 
atgactagga 
ttttctgtat 
tgaagaatag 
gggtctcagg 
tggcagagca 
ccagatctct 
tgattcaatc 
agatttgggt 
caagaaactt 
ttgactgagg 
cttctgagcc 
ataatgtcaa 
ttcaaatatg 
ataaaaagga 
aacattagga 
ttcttaaatg 
caaatcttaa 
gtattctaca 
cctgctttta 
tttgggaggc 
tatggtgaaa 
ctgtagtccc 
ggttgcagtg 
atctcaaatt 
ggaaagcctg 
caggtgatat 
acatttccac 
acatcagtat 
ctgtggccaa 
tggggtgtag 
tatcatgtac 
ctaaaatatt 
catcaaattc 
gaagtttgaa 
gtgtccttat 
ggacatagtg 
cttgccagca 
ttgttcaagc 



cctaggttac 
agaaggcact 
ggaaaaagat 
ttctttggtc 
tctcttggtg 
agcggtgaac 
cctgcttgtt 
agtgagagaa 
attctccaag 
ggcctgaaga 
tttgatttgc 
aaaacagaac 

gg^gg^cttt 

gctttctctg 
aaagctcctg 
aactattctt 
atgcctatgg 
gttctaactc 
caagtcagat 
cacctagtca 
ttgaaagaaa 
ttacatagtt 
ttcttaatca 
gttttcaaaa 
tagttcattt 
aggtttaatt 
aaacttacaa 
ggagacagag 
tgagaactca 
acctcccacc 
ggggacacag 
tctccagcag 
cacatcctgt 
aggtacttgg 
acgtgtttta 
atactaagta 
agacagattc 
aatttagtaa 
tttcatttca 
gaagaaagca 
atacttgcat 
agaggaaggt 
cgaggcgggc 
ccccatctct 
agctactcgg 
agccaagatc 
aagtaagtaa 
actcggtaga 
tacataagag 
aaggctctaa 
ccaacatcag 
tttattataa 
taggcaacaa 
agccagagta 
tatattaata 
ctaaccacga 
tgagatcgta 
aagaagaggg 
agaaggtggc 
ccgtggtctt 
cacccagaca 



ctcagtggca 
cccatttctg 
gaagtacaat 
tctcctcaaa 
attttctctc 
cctttatcgt 
ttocttccat 
gtaaaatgtt 
gcacagaatg 
atccctgagc 
aaacataagt 
ttaagcttaa 
ccgaaggnat 
atttacttct 
aaccagttct 
ttaaaattta 
aaggaaacat 
tacagaattg 
aatcacccag 
tttcttttgc 
attcttaatc 
gcagaagaat 
atgaattatt 
gaaatctttg 
tcacactgct 
gactcacagt 
tcatgccaga 
cgagtgaaca 
ctcactatca 
aggccccttt 
agccaaacca 
atttcacctg 
accaaggaga 
gtgcttttcc 
taatagtggg 
aggtcttgag 
ttagtggacc 
accttattac 
tcaagagcag 
ctttgatata 
tttcctctca 
cttggccagg 
agatcacctg 
actaaaaata 
gaggctgaga 
acaccactgc 
ataaataaat 
tggttaccct 
accattctgg 
gagccctctt 
agtaacgaga 
taaccagtga 
ttccataaga 
ctggcaaatc 
atttatatta 
ataggactct 
agggcagggt 
agagacacaa 
catctgcaaa 
ggatttccag 
gtgttatttt 



27540 
27600 
27660 
27720 
27780 
27840 
27900 
27960 
28020 
28080 
28140 
28200 
28260 
28320 
28380 
28440 
28500 
28560 
28620 
28680 
28740 
28800 
28860 
28920 
28980 
29040 
29100 
29160 
29220 
29280 
2934 0 
29400 
29460 
29520 
29580 
29640 
29700 
29760 
29820 
29880 
29940 
30000 
30060 
30120 
30180 
30240 
30300 
30360 
30420 
30480 
30540 
30600 
30660 
30720 
30780 
30840 
30900 
30960 
31020 
31080 
31140 
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attatggcag 

tagctgagga 

ggtaatttaa 

cttaaggtga 

ttttcttaat 

ttgggcactt 

caaaatgaca 

tagctctttc 

acacagaaaa 

ttacacaaaa 

aagggaacaa 

tgttttagat 

ctatactctt 

ttcctcgaaa 

gtctcaatca 

aattaagaac 

tggtggccgt 

tgatatagat 

gtatgtaaac 

agttactttg 

taagtaacct 

tattgacaga 

cacattgtgc 

tgcagtttga 

attatattta 

ttatataatt 

taaaggtttt 

aaaaatgtat 

aatgtacaac 

tagtgtatgt 

atatctatcc 

tctagtgtat 

tatatgtaca 

gaacatttat 

ttagagaact 

atgcaaaaat 

ttctcaattg 

tacgtcatct 

aaagccaaat 

agcaacaaaa 

gttaaatatc 

taagcaattc 

actagacatt 

taaaatcaca 

cagtttaacg 

ttttcaacca 

atctggatgt 

tttccaaatc 

tgttgaatta 

ttcacctgcc 

tacctgtatc 

ttcatctgga 

actacctcaa 

tatgataatg 

aaacaaaaca 

caccattttt 

caaaccttat 

ctctcaggat 

agttacccta 

ctcagccttc 

gtataattta 



ccaaagagac 

tctttcccaa 

gaaatatagt 

aagaataaat 

ttaggataca 

atttaggata 

tcaaaatattr 

caaagctaga 

attattcaag 

gattaaaaaa 

gcacatcaaa 

taatatgtta 

tcatattctg 

gacaaaaata 

cccatttgat 

ggatgtcata 

gtcatccatc 

gtatcccaag 

ttaaatttat 

ttttctaaga 

aaggaagtta 

aaaggtttat 

agaaataatg 

ctcttatcat 

gtgtggcttt 

tgcctaataa 

ataagaacat 

atgttgggtt 

ttgatgttgt 

atctattaca 

acttagcaaa 

ttaaagtatt 

tttatcttta 

atccactaac 

cacagcctga 

ccaatggctg 

atttgagctc 

tttattcaac 

tcctaattgc 

caaaatatat 

tattgccatt 

aggaggtcca 

ttggcagata 

attctcatgg 

aaagaaccag 

agaactcccc 

tttaaaggcc 

tctttccttc 

tttacaccag 

attacatgtt 

tgcttctaac 

ccatatcact 

gcatcttctt 

tattctacta 

ctatgaagcc 

gtgatagaga 

aacacttcat 

agagacccaa 

acttcctctc 

cttaatttct 

tttatttatt 



taatacaata 

tttaataatg 

gcacatcaca 

ttttatgaaa 

atcttagaaa 

agatcatgga 

ttaaagtaaa 

aaatgtttta 

tgggatacag 

tgacttttca 

aacagagtga 

ttaacaaaga 

gaaagatctg 

tacacacttg 

taagcttatt 

cagtatttac 

ctttacatct 

agatagtctt 

atttagtagt 

attctccaat 

gaaactaagt 

cataataata 

ttagttttta 

catgagaaaa 

atcagaaaaa 

tggaattaga 

ttttaatgtt 

gatgcaaaac 

gacatacata 

taccatggtt 

ttttaagtat 

agtttatttt 

aaagccaatt 

tgaggtcaag 

tatcttccac 

agatctcaaa 

acaagtaaaa 

agacagtaga 

ttcctccagt 

agttagtagg 

tgataccctc 

tttcttcagc 

atctacctct 

aaatatcaaa 

taagatgttg 

actgacacca 

tctcaattca 

tcttggggtg 

aaatctaaga 

tattaccaaa 

ttaaccacgt 

tagactaaac 

atccctccag 

ttagaaatcc 

ccattgatct 

ctcaaacact 

ctctcttagg 

gccctctatc 

ttcaatattg 

tcataggttt 

tatttttact 



atatttttct 

cttgctatgc 

aaacataatt 

tctcaaaaga 

gccaaaatct 

tacctaaaaa 

cttagaagtt 

tttttagtaa 

aatctctatc 

cattgtaggc 

aatttgccaa 

caaaattcaa 

tcacttttct 

tgtttccctg 

tcatagaaaa 

tgtattacca 

gcttgccatg 

ctggacagca 

caaggactca 

tttaagggta 

taaagctgtc 

atttaattta 

tgatcgatag 

gcaaccaaaa 

acatatagac 

taaacttgga 

ttccaacttt 

agcaattact 

tacactgtga 

atctttgtgt 

acaatacact 

tctttttttt 

agaatagtac 

acctttcaca 

ttcagccacg 

gagaagttca 

caaaatagac 

cctacagaga 

ggaaaataag 

aaacaaagtt 

tctgggaggc 

cccacagttt 

aaacccatta 

taaacatatg 

aaatcagttc 

accctatcaa 

ctgtgtccaa 

ttcactactt 

ctaataaatg 

ctctgttgat 

tcatccatac 

cattagccat 

ttttgtccat 

gtaatgattt 

taggataggg 

gtgatcctcc 

ataaagacca 

tggagtacaa 

tcatgaacca 

ctcaacttct 

ttcaatattt 



gtttaaattt 

agctctcacc 

atttgtagca 

gatgtagaca 

aaataattgt 

acagtggttg 

tccacaatta 

tcaaagatat 

acctattatc 

aagtcacgca 

aatataaaat 

tttctaagtt 

aaaatacaaa 

actctattat 

aactgggtta 

aaggtcacag 

gcaaaaataa 

tataaaaata 

actgttatac 

tttaacatta 

acactgcaac 

atgcaatttt 

accactataa 

tagagtgaat 

tttttaataa 

atttaaaaat 

attggtttat 

tttacacaaa 

cgtgacaatc 

gtttgtggtg 

attaaagatg 

gaatttggaa 

ctctttaatt 

attacagagt 

agtcaagaat 

gtttcccaat 

aataagaata 

tcatcgaatt 

ccatcagcta 

aaagaaacag 

aggaaaattg 

acctgactct 

acacataaca 

tcaaatgtta 

tcagaaattt 

aaattgccat 

aaaggaattc 

ctggcaatga 

atactttatc 

tttactttct 

catgacacca 

atgtttttct 

atatcagcca 

cctattgctc 

acttaaactt 

ctcctcttat 

aaatctatga 

ggctctctta 

ggattcccct 

catcactgca 

ttctcttaaa 



tatctagcaa 

atagtgtcga 

tcactctttt 

tttgattcat 

tggtggagat 

gttaattcat 

taaatattct 

atgaaagtaa 

agaattttga 

tcattaaacc 

attttagttt 

ttgtttttca 

actaattctt 

ccttagtaga 

atgggggaat 

gaaacacacc 

atatactcat 

acaagcacaa 

attcattaga 

gtcaaggtct 

acatagttac 

tcatatttta 

attcagaaat 

tgtaggtcaa 

atgaaattat 

gcttctgaac 

aattaacata 

ccaaatattt 

acaaacaact 

aaaactctta 

atatttttaa 

aaattggatt 

aaagagatag 

cacaaacgtg 

acacacaaag 

gtgcataaaa 

ataagccgat 

ggcagatcat 

tgtcttactc 

agtgaacaag 

ttcatgggct 

aagttgatga 

ttcccacagg 

tttaactcat 

gttggttttt 

ttgacatttc 

atgactttct 

tcccactttg 

tcactacact 

aaatatctct 

ccatcaccat 

cattggtttc 

gagtgataga 

ttgggataaa 

tcttattctc 

aataaagacc 

taccctgctg 

caatgctgct 

cactttctgg 

tcttagaagt 

tctcctactt 



31200 

31260 

31320 

31380 

31440 

31500 

31560 

31620 

31680 

31740 

31800 

31860 

31920 

31980 

32040 

32100 

32160 

32220 

32280 

32340 

32400 

32460 

32520 

32580 

32640 

32700 

32760 

32820 

32880 

32940 

33000 

33060 

33120 

33180 

33240 

33300 

33360 

33420 

33480 

33540 

33600 

33660 

33720 

33780 

33840 

33900 

33960 

34020 

34080 

34140 

34200 

34260 

34320 

34380 

34440 

34500 

34560 

34620 

34680 

34740 

34800 
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tgctgcctgg ttcaaaccta tgattatcca agtttatctc atttttattt tgtcagggcc 34860 

ttttccttga ctgttcaggc aagaccaagt tatacccctc tcctgatagc acttaaaaaa 34920 

cattgtaatc ttggccaggt gtggtggctc acgcctgtaa tcccagcact ttgggagacc 34980 

gaggcaggtg tatcacgagg tcaggagttc cagaccagcc tggccaatgt ggtgaaaccc 35040 

cgcctttact aaaactatga aaattagcca ggcacggtgg ctcatgcctg taatcctagc 35100 

tactcgggtg gctgagacag aagaattgct tgaatctggg aggcagaggt tgcagtgagt 35160 

tgagctgaga tcacatcatt- gcactccagc ctgggcgaca gagcgtgact ccgtctcaaa 35220 

aaaaaaaaaa aaaaaattgt aatcttacat tgaattttgt ggttcagttt ttttctctac 35280 

cactgcacag ttaatttcaa gagggcaggg ccatgtctaa ttctacttat tatattccta 35340 

ctgaatatta tgtagttgac atatattaaa taattttaat ttaaataaaa tgtatttatt 35400 

gagtgaatgt aggacttgct gtcatataac atatttctaa gtctgattct atcttctact 354 60 

atactagata attagttcca aattttatgt atatacgtta taatgtgatt agactcccat 35520 

aaaaaagtat taaataataa agagagttaa tgtgtcatat cacaagaagg ccagaagtag 35580 

gaaagttgta gtagtttagc ctctgaatca gatggggtta taaattaact tcttttactc 35640 

accagctgtg tgagcacagt tatatattta cagatcactg aaacattttt ttgaaattca 35700 

gtctgctcaa gaaaaaaaac aggaatgaga atattacctt tcacacaggg attctttcaa 357 60 

aattactgaa ttaatgcatt caaagaattc tttcagtact gttcaagcac ttcaatatta 35820 

ccaatgacag tttcctgcag tatcttggat attattcacg tggacctata gagtgttagc 35880 

catttaaaag tcatcagctg cttttgctgt ctccatgttt attgcagcat ttaggactag 35940 

gttcaactac tcataacaga aaatgttact gtagtggctt aaacacaaat aatttatctt 36000 

gtgcaaaaca ttcaggggtg agcatttggg gagaataaca ccaagaagcc agaccctttg 36060 

taacttcctg cagtggtttt agcaccttca ggcgggaaag tcaaagaaga gaggagtgac 36120 

attgggagac tttgatgtac gtatcactgc ccagaagcaa tcatttctaa cagtgaaaaa 36180 

ggaggggaaa tgaaggtttt tgcctccaca atctctatag tagaagcaga caaggcaata 36240 

ggggaaggga aaaaagtttg gtccatcata tatcctaggt gtaggtattt gagggtgatg 36300 

accataccag ttaattaggg ggagatgagt agatcctcat ctcttccaag tataggaagc 36360 

taactttgcc gtaattgtta ctaatacacc cttcaaaatc accattttga aaggttttgt 364 20 

gaaatgagca ttattttctt ttaatacaag gtcagacctt cagaatgatg aataaactca 364 80 

tttgagggca agtaccaatt cagagctgct gtaaagtagg cattttcaag aagaaggaat 36540 

acttctcatg tttgaactgt gtaagcaagt ctttatttat tttttgaatc ctgaaacagc 36600 

aattagaaaa gagagaggaa gatatttcct tttctgaagg ctgccaagaa taaatcctat 36660 

tctcatttgt taaaataaaa agaggtaata agctaaagta agaatacact aaccttattc 36720 

ttgaacccag ccgcctctag aaaaaaacac tataaaagtg aagctaagtt cagtaatttt 367 80 

tgcagtcatt gcctttttgt tagctctagt tatctttcaa gttttattga taataaaaca 36840 

ttaattgcaa ggacaacttt tgaaatctat tattgaatac ctaggacact .catgatgtct 36900 

ctccacattc ttgccaagga attgctttat gaattaaacc acttggggaa ttaaaccact 36960 

atcaattaaa catacagttg tataaggata gacatatact actttacatg caaaaattta 37020 

ggccatagca atttagtaac taccacgtta cagggactgg aacttttcac tcctccactt 37080 

cttctatctc accactctgt tatatgaagc tatgattctc cattcagaat ataactacat 37140 

tagtttttga aataggaaaa tatgaaggac agactctaga aatatcagaa tcacttggac 37200 

aacaaagtcc aaaatatgta ttatgaaagt gaattatcat gtttgactca atatgttgct 37260 

gcaaataaat aaaaattatt tgctcaatat agatactcct tatatgcctc aatattttat 37320 

atacactgac tgaatattta tgactatgta tgagcagaat aatgagaaaa aaacttacta 37380 

aaaaatctaa ttattttgtt ccttttctta ataaatgtcc cgtttcacag ttataacact 37440 

ttctggattt ttatttttat tctgacttta taaaataaac attattttta gaatagtttt 37500 

aaattaacaa aaaaacacac agagattaca gaatgctcca tatatgttat acatatatat 37560 

atatattata tatacacaca cacacatata tttaaatacg tatattatat atatatttta 37 620 

tatatcccgt cccctcccct agaaatcact gaattgttta ctggctctaa agttttgctc 37 680 

tcattattaa atatgtaaca aatatatatt atatatttat atacaatgta tataaaataa 37740 

atatatatac acatgtaaaa aaatatatat atattttgcc tgctccagaa ttccattcca 37800 

tccaggatac cacagaactt tcactaatct tgtctctttg ggtgcttctt ggctataaca 37 8 60 

gtttcttagg attttcttat ttttgatgac .cttcacagtt ttgaggagtg ctggtcagat 37920 

attttgtaaa atgttccttt cttgtaattt ttctgatttt ttaccatgat tagactggac 37 980 

ttagggcttg tggggatgga ctgtcacaga ggtaaaatac attctcatca cagcatatca 38 040 

agggtttata ctattgacat gacttaaaac tattgatgtt aacactgatc acctggttga 38100 

ggtagtattt gtcaggattc tccactgtaa agttaatttt ttccctgttt ctatattgta 38160 

ttttagctat atcaataaga actcatggac atttctcttg tactttgggt tataatccaa 38220 

tcttttttat ttttgctcaa .attgttccag ctttggtcat cggacacttt aagtagctgc 38280 

tgtgtgcttt tgatgttatt gcagcttggt ctgtttagca ctttcttact ttctaaaatg 38340 

tcaagatgct ccaggcttat cttgtatatt accttatgta gctccaggat taaccatttc 384 00 

tccaaggttt tgacaaactc atagtgtcat gtattcacca ttacagtatc atacagaata 384 60 
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attttacagc 

ttactgattg 

catgggtatg 

atccatgtct 

atggatatac 

atttgggaca . 

ggatataaat 

ataatgctac 

attttgagtt 

ttattttcat 

tataagtaat 

ttttattctt 

cacatctatt 

tcctgggtca 

acatacattt 

tttatttatt 

aaaactattc 

atttctgtac 

tagttcccat 

gtgaggtttt 

tataaggggc 

atgccttttt 

ttgaacatct 

accgatgata 

atatcatgct 

tcttttgact 

atttgtatga 

tttttgtggt 

tatgtcttct 

catcaatgtt 

tgagtatatc 

caaatctttg 

gtaaccttgc 

tgaggtggtg 

tttcactttg 

ataggagata 

tttgctattc 

gtagaatcca 

gtacctttgt 

taagtaattt 

caccaggaaa 

cacacagcac. 

actaagactc 

ataaggagga 

gcctgtggta 

cattttcagg 

acatggtatg 

taaaacctgc 

tttgctgcct 

gggggacatg 

agtttaataa 

attaaacata 

acctttgata 

ataatcatct 

tttaacaatt 

tagagtgatt 

tgctatata.a 

ggatttcgtt 

gtcatgctgg 

cctcccatta 

gcagcaatat 



ccgagaacag 

cttactggct 

tagcttttac 

ttgtgtatgt 

catggtttat 

attataaata 

gttcaaataa" 

atttttcctt 

ctgatcagca 

tatgttgttt 

ttattagata 

tgaacaattt 

tgtttaattg 

gacagatttt 

gtgatccatt 

tatttatttt 

tctttttctt 

aataatcata 

gattcccacg 

ccctatactg 

acctctttca 

ccttccacca 

ttttctttgt 

tctgtctttt 

atcttgatta 

ttcttcttat 

aatttagaat 

tacattgaat 

aatgcataaa 

ttatagtttt 

atttattgga 

tctttcatgt 

tgtattcact 

ctttatccta 

tcctgctttt 

aaactgaaag 

atatgtgttt 

cagataaaca 

atgtacccca 

tacacctgct 

aacttcctag 

agtagagaat 

aggcacgcac 

aaaaaaccag 

tttctcttcc 

caaagcttta 

ggactgtaaa 

aagccatgcg 

ttgcttccat 

aataagtgtg 

ttgatttcta 

gctataatta 

atattcatgt 

ttttttggac 

atatagcaat 

ggactatcgc 

ataaagtatg 

gcacgtgaga 

ggatgtgagc 

ctgagagtgc 

aatgaactgg 



ccctgtgatt 

ctaaagtttt 

agactgactt 

tgatagccca 

ttgttcattc 

aagttgccac 

gttagttaaa 

gttaagaaac 

gtggatgaaa 

tctttttggt 

tgtgttttgc 

attttataaa 

attgtgctat 

ctattatgtt 

ttgatttaat 

tgtatgtgga 

tcaacctttt 

tgatgtggct 

tgttgtggga 

tcctgtggta 

ctcggttccc 

ggattgtggg 

agattgccca 

ctgttctgtt 

ctgtagcttt 

tcttttttac 

cagtttgttt 

ctacagatta 

catgaaatat 

caacatacaa 

tgctattgta 

atgagaaagc 

tattattcca 

taacttctat 

ccttgtaaga 

tccacacatt 

tttaaatgtt 

caattagcaa 

caaaaaatag 

ttctcttttt 

caacctctga 

tgaaaaattt 

atacacatac 

cattacaaga 

atttgaccag 

ggaagcacat 

aatgacttat 

ttgttctctt 

ttttggactt 

tgtgcacatg 

taattattaa 

ttaaatatta 

gtgcaaaata 

aaacaaagaa 

acagtggcta 

aggtataaac 

ttctaaaaca 

atttctctgc 

cctgaagcag 

tgattacaaa 

tttatagttt 



cacccagtca 

gctctttaca 

ccttcattta 

tttttaaatc 

acctattgaa 

atgtataaat 

cacctggtag 

tgacaaattg 

gttcccatta 

gagttttaaa 

aaatatctcc 

acataagttt 

tggtgttatg 

tttgtctaga 

tttgtgtaag 

tgttcaatta 

tgaccctttt 

gtgtcccaac 

gggaccctgt 

gtgaataagt 

attttctctt 

cctccccagc 

gtttctatca 

ccattgatct 

gcagtaaatc 

tactgtattt 

ccatctacaa 

ggttgaagga 

tgctccattt 

atcctgtgca 

aattataagg 

catggacttt 

cctaagcgga 

tctctgatgc 

acaggaataa 

ctgattttaa 

tatgcttgag 

atgtaagcta 

atgtcaacaa 

tttaatgaaa 

tgtattcctg 

gagaaaaaga 

caaagtgatt 

gtgagacagt 

aaatggcaaa 

gatttccaca 

tccaagatga 

attgaagttt 

gagagagaag 

cgcacctgtg 

atatagcttt 

attattttat 

tattcttctg 

aacatataga 

actatagcta 

cctagcaggg 

cttgcatgat 

ctgttcttcc 

tgacaggaaa 

gctacaatta 

ttcaagaaga 



acctttccct 
gaatatcaca 
gtaatatgga 
actgaataat 
aggcatcttg 
atgcatgctg 
tctgattgct 
tcttctaaag 
ctccatttca 
atggctttgt 
ttttagtctg 
ttaattttga 
tttaaaaact 
aatttcatca 
gtataaggtc 
atccagaatc 
gtcaaagatt 
ccaaatctca 
gaaagataat 
ctcatgagat 
ttgctgccgc 
cacgtggaac 
ggagcacaaa 
aagtgtttat 
atgaaggcta 
gtttttccag 
aatagcttgc 
aattgacatc 
attgagatct 
tattttgtta 
ttttttttac 
ttatattgcc 
tctcagctgc 
atccaataaa 
tgaagtccaa 
aaaatcttct 
gtttgaattg 
taattcttac 
tttggaggct 
cccagttgca 
catctagagt 
gtcagttgtt 
agcaccttag 
ggcaggttac 
ttctgttgaa 
tagaatctgg 
gaacacatgg 
tgttaaagct 
aggcatcaag 
tggatatgtc 
actaattgat 
atgtttagaa 
tcctggtaac 
caaacaggat 
ctctaattaa 
ataaaatctt 
ttattccacg 
tattaaccac 
tatctgcaaa 
catgtggatt 
aagcatctgt 



cccttagaaa 
taaatgaact 
tttaagattc 
atttcatcac 
attgtttcca 
ctttttgtgt 
ggattatatg 
cagttgtgcc 
ttgtcaacaa 
atattttata 
tggattgtct 
taaagtccaa 
catcaccaaa 
tttgtatttt 
tttttctaag 
atttgttgaa 
aaatgatggt 
tcttgaattg 
tgaatcatgg 
ctgatggttt 
catgtgggac 
tgtgagtcca 
ataaactaat 
tgattggcta 
ataggacaag 
gtcttgtgcc 
cacgattttc 
tcaacaatgt 
ttgatttttt 
gatttatatt 
attttaattc 
ctgatatcct 
atctgctgga 
attcattgat 
gctttcacac 
attcttttgt 
tttatgtaat 
agttttaaag 
actttttatt 
gttctttgaa 
agaaacacag 
ggccacattg 
ttgagctcaa 
tactagccct 
aagaatggcc 
cctttgtcag 
ggtagagctt 
gttccagaca 
acgtctgcag 
aaaataataa 
agctataatt 
aaaatcctat 
tgctctgttc 
taatataaaa 
ttacctgttc 
tttatttata 
aatactgcat 
taaatgttat 
atggcaaatg 
attcatgttt 
agtatagtaa 



38520 

38580 

38640 

38700 

38760 

38820 

38880 

38940 

39000 

39060 

39120 

39180 

39240 

39300 

39360 

39420 

39480 

39540 

39600 

39660 

39720 

39780 

39840 

39900 

39960 

40020 

40080 

40140 

40200 

40260 

40320 

40380 . 

40440 

40500 ■ 

40560 . 

40620 

40680 

40740 

40800 

40860 

40920 

40980 

41040 

41100 

41160 

41220 

41280 

41340 

41400 

41460 

41520 

41580 

41640 

41700 

41760 

41820 

41880 

41940 

42000 

42060 

42120 
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aaattactct 
gctgtgttag 
gtaagtattt 
gacggttctg 
ccacctgtag 
attggcagaa 
actttcccca 
ttgacctggt 
gtcactaagg 
aatgatataa 
gttaaacctc 
gaattattgg 
ttgcatcagt 
aatgcatgtt 
caccctcctc 
agattttgtt 
ttaattatta 
ataatgaatc 
gaggctggtg 
cccgtttcta 
gcaactcagg 
agctgacatt 
taataataat 
. tcagaattat 
gtgtttttgt 
gcttctgcag 
gtccttggta 
aataagatca 
ttattttcaa 
tacttttcat 
aatcagaaaa 
tattaactaa 
agtatataat 
gttaataata 
tctttcttct 
caatctgtgc 
cgcttggtct 
atgactaaat 
tttccactat 
cctttttttt 
ctcggctcac 
agtagctggg 
acgagatttc 
ctcggccttc 
ttttttatgt 
aagtgatttt 
atggaaatac 
tttgcctgca 
atgagccaat 
tcattggcaa 
atgagtagtt 
• aactaaatat 
aaagcaatac 
tgatagttat 
atggcgtaga 
aagctgtgat 
tactagggcc 
gtaaaaatat 
gattttctgg 
agccacaaca 
agggctgctg 



gaatcttgga 

ttaactattg 

gttatctctt 

gcttgggatc 

gtatgacagt 

gatatccact 

gaatgggaga' 

cttagacgtc 

gtggtcatac 

aaaaaagggt 

gctgaacatc 

ggctgttcag 

gtaaagttga 

gaggaaatct 

tcaaaaaaga 

gtaatcattg 

attaaattat 

aggtgcagtg 

gatctcctga 

ctaaaaatat 

aggctgagac 

gcaccactac 

aatttgttaa 

ttatcttatt 

ttgttttccc 

cagcgcagtt 

ccttctgcct 

aagcacaaaa 

ataaatttag 

ccagattcac 

ccaacattca 

tagtatcatt 

attaaataca 

ttaactaatt 

ggttccacga 

cagtccctca 

gttattgtgt 

tgaggttatg 

atgatagcat 

tttaagacgg 

tttaacctct 

attacaagca 

accacgttgg 

caaagtgctg 

tttgtttgtt 

tttttaaatt 

tctgtttttc 

gtgactactt 

gagaaaccac 

attttaaaaa 

tgaatttctg 

atttacaaca 

agccaagttc 

gtacttttaa 

catcaataca 

tctttattta 

tctgatactt 

atggcagctg 

agtgtctgtc 

aagagtgttt 

gtgttaattc 



tcaaagaatc 

ctgtgtaaca 

acagatttga 

tttcatgagg 

ggctgcagca 

cctcactggc 

tacaagaaag 

acaagtcacc 

ttcagagcaa 

gtattttcat 

agtcttctcc 

taagatgatg 

ttgaatgtaa 

tgtttagtga 

agcatgcaaa 

agaataataa 

tatttaatta 

gttcacgcct 

cgtcaggagt 

aaaaattagc 

aggagaatcg 

actccagtct 

tgaaaaatgt 

atttttggtg 

atatgacaag 

ctgtgagcca 

gctccccttg 

aaaaaagttt 

acttttggaa 

ttaaggataa 

ctatgagcca 

atttacttat 

gcaatgtagg 

tacaggcctt 

ttcacaatgc 

ctcttttctt 

atagaatgcc 

cagtttttca 

gtgatgtatg 

agtctcgctc 

acctcccagg 

tgcgccacca 

ccagaatggt 

ggattatagg 

tgtttgtttt 

tataacttgt 

atcatacact 

atgtagtgtt 

agacatcaca 

gttaagacaa 

gtcataagga 

ggaagaacta 

tatttgcagg 

aactggaaag 

tgatcagtca 

ttcaaaatca 

attggaagat 

tccaacaaag 

tgtgcacaag 

aaacaaacca 

tatatggatt 



cgggttccaa 
aattatccca 
tgggtcaagc 
ttacagtgat 
tcggcatccg 
agcttgactg 
agagggtgag 
atttatacac 
gaaaactagc 
accaccatgg 
acaaatgaaa 
catctgacat 
tgtcagtgat 
cttccaagtt 
tctttgactg 
tttaattatt 
tatataaata 
gtaatcctaa 
tcgagaccag 
caagcatggt 
cttgaaaccc 
gggcaacaaa 
taatttccaa 
gattgcctaa 
aggctggaga 
tggcccctct 
cagcactgat 
gctgaatatg 
aaactgtaaa 
cgtcttttat 
attggtatta 
ccacaataca 
atagtaatga 
attaaaatat 
atttatgtgt 
gtcttttgcg 
cctcagtttg 
aaactgcggc 
agatgtaaac 
tgtccccagg 
ttcaagcgat 
cccccagcta 
ctggatctct 
catgagccac 
gatttctatg 
aagtgtcttc 
tgcccactaa 
taatgtgatg 
tttctagctt 
tttggtttca 
tagtattctt 
gatatacttg 
tcgagatctt 
agatgggaaa 
agttttactt 
ccatcacttt 
aaagttaagg 
tcaataatcc 
tttgtcagca 
atctggatgt 
aacaccacag 



tccagccttt 
agacttagtg 
atttgggaat 
gatgccagtc 
aggtatctta 
gcctcacttt 
aaggaaacca 
attcagttta 
ctcccttgta 
tggccatatg 
ataacaatat 
attttgtaaa 
gttttgtcat 
aggcagacac 
ttaaagtata 
tatatataat 
tataattatt 
tcccagcact 
ectggccaac 
ggtgcatgcc 
aggaagcgga 
gcaagactcc 
agaatgatag 
atatacagac 
ctggcaggca 
gtttctgggt 
ccttcattcc 
ccctgtttta 
gatagtagaa 
aatcatggta 
gttacgctat 
atgatg.taat 
tagtactact 
tttcagtttt 
catgtcacca 
aacttgaaac 
ggttagtctg 
agatgtgatg 
acggatcatt 
ccagagttca 
tctcctgcct 
atttttgcat 
caacctcgtg 
catgctcagt 
ttgtctgtca 
tgggaagata 
ttttagcatc 
atttctgtag 
aattctgaaa 
tcaggagacc 
gatcttctta 
caaaattcaa 
ccaaagaaat 
ggtacaagtg 
cttgttttaa 
tttctttaaa 
agagaaaagg 
actttaggga 
tttgagagaa 
caaaaggtta 
ccgtttctta. 
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atctcataaa 


4 344 0 


agtttaaata 




tttgtttttc 


/I "5 C £T f\ 


ggcagcatga 


4 362 0 


aatcaccctc 


4 3 680 


aggcagatgg 


A ^ i A 

4 374 0 


ttttaatttc 


4 3800 


agtttttgta 


4 3860 


tatttattga 


4 3920 


aactaattat 


4 3980 


aacaaaatgt 


4 404 0 


gctagtttta 


4 4100 


accactaatg 


44160 


ctgttgtctg 


4 4220 


tcatggagaa 


4 4280 


atgttttcac 


4 4340 


atactgtgct 


4 4400 


gttttctttt 


4 44 60 


gtggcatgat 


44520 


cagccttcca 


4 4580 


ttttagtagg 


h 4 d4 0 


atccacccac 


44700 


ggatcattt t 


4 4/ bO 


gai.gx.ct t utl 


A A Q 1C\ 
*i 4 o 


t. utuxaggcn 


4 4 O OU 


a a uaaT-ggcx 


4 4 y 4 U 


ggatata taa 


4 0 UsJU 


ggtaaacgtc 


4 5060 


caaagaaaaa 


4 Ox^U 


atatggaagc 


4olbU 


agagaaaacc 


45240 


ggcagtattg 


45300 


gatgttcatc 


45360 


tgagaagaaa 


45420 


atagtcagac 


45480 


ggtgtataaa 


45540 


ccccaaaata 


45600 


atgactgagc 


45660 


tgtgccgcag 


45720 


aaatagaaaa 


45780 



47 



WO 01/40493 



PCT/IB00/O1507 



acttgaaaga 

atttattttt 

atgaaaagtc 

gaggctatgc 

cacccctcgg 

attagcctgt 

ctcatttgca 

ccccagtatg 

gtacatgacc 

actacagggg 

agagcaggag 

tcaaatcata 

ttaacaaatt 

ataaatatat 

cagtggcttt 

tcataatttt 

ctcaggcctt 

ctgaatatag 

agtgttatca 

tagataaata 

ctgggcactt 

tgtacaagtg 

aattgctgga. 

ttccttggct 

gggtcctgca 

agacagtaaa 

tattatataa 

atattaaaaa 

ggaagtaata 

ctgggataaa 

taaaaaatat 

actttgtaga 

tgaaggatat 

agagctacgt 

atcttttgaa 

cagaaaacta 

gctgccttta 

taacattcaa 

atggatatct 

ctaaatgtgc 

taaactttta 

ttccaattca 

caactagtat 

ccttcttgcc 

ggacatctca 

cattcactaa 

ctacatgttg 

gactgggcgc 

taggacgatg 

gggatttctg 

tagaaaaaaa 

ttatcaggga 

cctcattttg 

acatatccag 

cacatggtcg 

gtactcagaa 

atatcgagca 

gtaatcctta 

tcaatttaaa 

tatttttatt 

tacagaagtt 



agtgagtgag 
ttctttgaac 
tgtgaatcaa 
tggaccccgt 
catggggcaa 
gtctccttct 
agagcagatc 
ggcctctctc 
actgggaccc 
tgtgtgtgtg 
gctttcacaa 
ccaaaaggta 
gtttcctttt 
acaaaaccta 
aagttcattc 
aaacgaaaca 
ggtaaccatt 
tggacacatg 
agtttcatcc 
atatttcatt 
gacttgcttc 
tctgcttgag 
tcagacagaa 
ttcccttata 
aatattatta 
tcaatccttg 
atgagaaatc 
aaaaatccaa 
gaaatataaa 
aattaagcct 
tgtatgacta 
aattctttgt 
cagaaagaaa 
tagcagtgag 
gcttatacag 
gcatgagcta 
gggaaagctt 
ttgttttttc 
tccccttgct 
tggtgacaga 
tttcaaaaca 
taggtcattt 
ccatccttat 
ccgtaaaaaa 
cttctctttg 
ctaagtagtc 
agtaacttgc 
taacatgttt 
ttttgggcat 
aaaggtttaa 
aggaatcaaa 
acttcttaag 
gaaag.tgttc 
tggtctttct 
gagatggctg 
ggcctcccca 
gccaccataa 
acctctgaga 
cctttaagct 
ggtggatcca 
cacattttag 



ctcagagacc 
tggtgaagtt 
tcacaggacc 
actgtagagg 
gaagtccaag 
gcacaatgct 
tcttgcttgg 
cagcaagggg 
tccaaggtgt 
tgtgtgtgtg 
tattccagat 
taaggagtta 
ttatataaag 
cataaaattt 
atgatgttgt 
tttgttccta 
attcttcctt 
cagtgttttc 
atgttgttgc 
gcatgtgtgt 
catgttttaa 
tccctgtttt 
actgtatgtt. 
ctatttgcac 
gggacaggac 
taaatgttag 
tgggtttcca 
atcaagctgt 
aagaaaactt 
ttaaaatttt 
gcttcttttg 
cttaaatcat 
tgtgaagaaa 
acccaattta 
tgttctgatc 
aatgtcttta 
ttaatcagaa 
acactgctct 
tagcacggtt 
cagagaggga 
atggagagca 
ttttgaaatt 
ttctagaggt 
tattttgtaa 
ttaaaaattc 
ggaaactttt 
tcacggttac 
ttcccaagaa 
atgtttttcc 
attaaatagg 
tgotaattca 
gtcagagttt 
aggactttta 
taaagattat 
aggcttaaga 
cctctcttgc 
tatgtgtgtg 
ttctcactgg 
gttttgcttc 
tatgacaaag 
ctacccgata 



tttttttttt 
ctctctactc 
cagttctgcc 
atccccactc 
aacgccataa 
ttctgcggct 
attcatgttc 
atgaggatgt 
ggcaggctag 
tgtttgtgtg 
ggaagacagt 
agagaagaga 
taattttagg 
accattttca 
acaaccatcg 
ttaaacgcta 
ctgctttcct 
cctttatgtc 
gtgtatcagt 
accgcatatt 
ctacagtgaa 
caattttttg 
taacattttg 
atgtagcatg 
gtgacagcaa 
tctttcactt 
tcagtgttca 
tttgtggtat 
gatcttctga 
tgaggttcat 
gaaggtaaaa 
atagcttaaa 
gaagaaaagc 
aagagtttgg 
cctaacaacc 
tgtttgaatt 
tgtctcttca 
tgaaagagcc 
tatgtcaaga 
tgatcattca 
catcccatca 
gtattttctg 
atcaccgtgc 
gtattcgtat 
cttatgttta 
gagacagctt 
atgatagtta 
aaataaacct 
aagcatctgg 
atttcccttt 
gaaggtattg 
cactatcaca 
ggtgcagttc 
aaggacttag 
atgctgcaca 
acctgtgatc 
tgtgattaat 
atattaagga 
tgatactatt 
tgaaaaggag 
actgtaaaaa 



gtaggaagtc 
agtgcgactg 
ctgagacctc 
ctaccctctt 
gccaagggaa 
cttccttgca 
ttgtggtcac 
tgtttgatcg 
gcaggaatag 
tgtgtgtagg 
gaaatccgcc 
ataggatata 
tggtaaaata 
tcatttttca 
gtactatata 
acttcccaat 
gcatttgcct 
tagcttcttt 
acttcattca 
gcttatccat 
tatgctgcta 
gggtatataa 
aagaactgac 
cagatctcca 
ccttatggat 
tttttcactt 
aacttggaaa 
ttcagatctg 
ggttttcttt 
atatcaatgc 
aaactcagtt 
aaatgagcct 
acaaattaaa 
gtccagccag 
catattcttg 
agagaacagg 
tcctggcttt 
tgcggctcta 
ttttgccctt 
aaagtagtct 
ctgctacgaa 
attaaaaaca 
atttcgcacc 
gtacccacag 
cagcttttca 
tgcagaacaa 
tgtaagatgc 
ccattgcatc 
tagggacaga 
aaaattttga 
agaacaggaa 
ttctagcttc 
tggacacctc 
cagcaaatta 
gcatcctgag 
ctcatcttgc 
caattattac 
tcttaaaaat 
ccatgtctta 
gagggttttg 
caagggcaat 



tagatgagac 
gtgaggttgc 
aaggagagag 
gtggacaatg 
cgtacacaca 
gagtgtggag 
accacaccat 

cagagaattc 
taaggaatat 
aagcctaaat 
ccttacttgt 
tacagaaaat 
gtgtgaaatt 
cagaacattt 
cccctgtacc 
attctagatc 
cctgtagcac 
cat aatttta 
tcattagttg 
tgaacattgt 
ttaagagtgg 
aagttatcgt 
ctgtagccct 
ctatacatgt 
ttgctttcca 
cttttgtgtc 
actgaaacca 
accaaagtaa 
cactgatttc 
ctcctacaag 
tttaaaactg 
agaacgaaca 
agcttttcaa 
aaaaattaac 
gaaaggtctt 
tgacctttcc 
agaaagaagg 
gtaaaagcac 
ctgcttttgt 
aagttctaac 
catatttctc 
ccctaaataa 
tccataatta 
aagggctttg 
gacacaacaa 
ttggtgatag 
tagtccacac 
tcaggtctaa 
tagagaaatt 
aaatattagc 
ttcagtgcct 
acatgtctga 
gtctgttttt 
gtgattatta 
ccttttccaa 
taaaggatca 
aggagaataa 
gcattcaatg 
gaaaaagaga 
tctgtattgg 



45840 
45900 
45960 
46020 
46080 
46140 
4 6200 
46260 
46320 
4 63B0 
46440 
46500 
46560 
46620 
46680 
46740 
46800 
46860 
46920 
46980 
47040 
47100 
47160 
47220 
47280 
47340 
47400 
47460 
47520 
47580 
47640 
47700 
47760 
47820 
47880 
47940 
48000 
48060 
48120 
48180 
48240 
48300 
48360 
48420 
48480 
48540 
48600 
48660 
48720 
48780 
48840 
48900 
48960 
49020 
49080 
49140 
49200 
49260 
49320 
49380 
49440 
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gcttacattt atcttttcag tggatctatt taatagaaat agaaataaaa cacatatcaa 4 9500 

taattgcacc aagatactga aggtagtttt tattgaaaac ttgatcacta aggaattgac 4 9560 

agaattgaga agtgagaaaa tttcaaagat acacatgcct gaaagcagaa atcatatctt 49620 

atatgtcaaa catgaaaatc aaagtcagct gcttgaaatg tgggcctaat aaatcttgaa 4 9680 

tctctgcaac tgtagggaat agttattgtt ttctaatcta gcacacattt tagtaaactt 4 9740 

ttctgtcctt aatgttaaac ttcatatttc ctacacatct tttttaatgt taattatttt 4 9800 

aatatgcatg catatattta ccacttatta catgggaaga aacaattctt aagtctgaga 49860 

aggttggtaa tcatttacat ttctgacaca aaactgaagg catggttgag aattagaata 4 9920 

attaaaatag cactggggga caacctgaaa actgtgactc cattctgata gaaggagtca 49980 

ttgtccttga acaatttgca aatatgttcc cttcttttcc caaatatgtt cccttctttt 50040 

cccataacag gcttccatgc tttatgcaac taatgcgtta tgaattgcag ctctgtttct 50100 

tacagagaag tgaatttaca aaagacatac ctttcacatc atgttcacct ttttcacctc 50160 

aatgacaccc tgctcccttc tcttaccttc cctgacacca cacagttcac catcagctct 50220 

tcctccttct caacacccta acttctaacc tgcaactgaa gctcacagtt gtgcttagta 50280 

tctgcgtgac agttcccttc acttgacaag ttttaaaagt caactttctt tgtcttacag 50340 

tttattgtct tcagtctttt atttccctta ccttttaata aaaggattca gtattccatt 504 00 

tctattaatc ttttattaaa gtttattaga aatagaatgc tgaatagaaa ataggaagtt 504 60 

tagctgtagt taagattgtg attattcatt atactgatat ttttcttata ttgtttttga 50520 

actgatttat gggtagttag ctccaaaact gaaggaatac tatgatattt tcaacactat 50580 

taccagcact atgtcaggta ttaagtttaa tccatgagca gagtgcaatt. tcaagatcat 50640 

gaattttcac aaacttcaat gttactactg tgtttatgct gaataaatgg tttctttggc 50700 

acctctttca cacacacaca cgcacacatc tatacatggt aacataggct aaagcaatta 50760 

gagaagttca .ggtatttcaa .tgactgaatc acaaaaatat gttattgatc tcatttgtta 5QB20 

atcaatgaat ggttaataag aagattaaga agacaaatta caaggtcaca gcccattcag 50880 

ctgaagcatg ttgttatggc cagaggctca tgagaggaaa aggtgtggct aagaacagaa 50940 

gatccgttaa gctccaagcc tgttcaatta tttaagtcac tgagacaaag tagatgtgga 51000 

gtgtgagacg ccagtgtaga gccttcaact gattttcctt ggggaggttg acctgactta 51060 

atagtgtggg aaatcaggtg aattttggct gtgactagat ttgaccaaga tggagttttc 51120 

catgaaaatg caaaccaatc tttagaattt gaattccata aagcaaaaaa ttcttacata 51180 

aatgcttata attttaccac tctaccactg tgggatgcat gccacgagat tttgtgagta 51240 

gttctaaaaa gtatgtcttc ttggacaaca tccaaagtag accacatgca cactgaaaca 51300 

agccttacac .aatagctttt ctatatatgt atctatatct acatctatat ttatatctgt 51360 

atctagctaa aagaaaagct ccacatactg atagaattat ataaaagcaa atactaacgt 51420 

ttgggaattt cgattataat acacttaatt gaaaacaaaa caaaaatatt ttttaaagta 51480 

ttaagcaagc atagagagac attttggtgc ttacaaggat ttctgggaat acaagcaatt 51540 

attttttgtc agggaccata tatttaggta cagatcatta acaatgcttt ctttaatatt 51600 

tatttagtga attgtacatg ttaactctaa gggcaattaa aacccataat tataatatga 51660 

gaaacttttt ctccacttag tattttttat tataaatatg gctagcattg aatcattggc 51720 

tataaccatg taattttaat gcttgggaaa catctgttag aaaaactttc cctatagaag 51780 

acatttatca atgtcctgcc ttatctctgg ttatctttgg tttattacat tttatgataa 51840 

ccataaaact gaaaccatcc tcatttacat ttttcctctt ctaaatttgg ggaaattata 51900 

ttttaagtgt ttatcattat ctcattatat aatctacaga aataataaac agcagcatac 51960 

ctgattttaa aatgagactt ttgaatttct acaataaata gaatataacg aactctgctt 52020 

ctttgggtat atcacgtcaa atatacatta atgagcagat agctttaaca tgtttatcat 52080 

gttataatct gattgttgtc tttactaatc aagataccta atttaactga agtatcaagt 52140 

gatctcaaaa cctcagtagc atctgataaa ggttgctgtc tcacctgtgc cacatggcca 52200 

ttgccatttg actgctgctt cacccacatc acctgcactc ccaaactcag cctggctgga 52260 

tatcttttta caagatacct gataatgaag gagagaagga acatggtgaa cactatgcag 52320 

tctcagaaga tctgcttgga agagacatgt tacttctgct gtcatttcct tggccagagc 5238 0 

aagtgaaatg ctgaacctga tcagtggctt attgtttgaa cagtaataca acctactgtt 524 4 0 

ctatcataga atgatattgg agtcatatat ttcaattagc cattatcaat gtctctcata 52500 

attttcttta cacattttct cagaatctag agtaaaattt aaaaaaggaa agacagataa 52560 

atgctctcat ctccaaattc agtgtcctaa attccactta ctttaatcag tataaaaatc 52620 

aggtagctca gattttaatg gactagtcac ttttgtctct ttttttttgc tatcagcgtt 52680 

tatattgatt 'atttggaaga aaaaagagcc ccctcaacct tccagagcgg acacacacac 527 40 

gtacacacac acatcggcaa cattagagag gttgacctga aagtgtttta aaaactatga 52800 

cacagttaaa ctatgatttc aaagtttttc caagcaagtt atatgtcaga atatgagtag 52860 

gggaaaacta ttgaactgcc acaattctgc tggtggtttc attggcaatt ccaattatgt 52920 

tctgatcata gtgtcatcag agcaattagt tcatcccaaa cttgttttgt attgccactc 52980 

cacagggaaa atagtaaatc tgtcctactg actgcagttc tctaaaatgt ccttaaagtt 5304 0 

tagttatttc aaacttagtg gaagaaatat gttgttggta gtggtcatcc ttgctagtac 53100 
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tctttaatca 

agaattattc 

ttttttttgt 

tgtatttttt 

ataacatcat 

tttttagact 

atgtatgata 

gatcattatt 

cagtagtgaa 

aattacctaa 

ctctccttcc 

cccagaacac 

attacatgca • 

ctttgaaggt 

tcagagacaa 

caaacaaaaa 

ggcaaaatac 

taaatacata 

ctttcaaaac 

ggctgctggc 

tagaagtgta 

aaaggtttgt 

attccattgt 

ggccttacat 

ggttcaactt 

tagtgtcttt 

agtattttat 

ttattcactt 

cataagagtt 

cctttattcc 

gtactacgtt 

gaaaaacttt 

tgttttaatt 

tgattacatg 

cactgtttaa 

ttttcgtatg 

aattccttta 

taatgttcat 

ttgggtatca 

atattttgga 

gcaaaaaagc 

tctcctgact 

ggctttgcgt 

attactttcg 

ctatgttcct 

taaaggtttt 

cctattattt 

gctagtttcg 

gatgatttaa 

ttcattatgt 

tttctaattt 

ccacaaattt 

ttgatcagaa 

tgacctaata 

ccatagaaac 

cattcaagtt 

gtggggtatg 

cactttttgc 

atatcttctt 

aatctacttt 

aaaatatatt 



ttacctgcca 

tactaccgaa 

aattctaact 

ccatttttta 

ttccagagtc 

tcctaaaatg^ 

cggtttcctc 

ttttaaagaa 

ttcagaatgc 

tttgtaattg 

ctccccggag 

ttagcaattt 

ataacgatac 

ttagatacga 

gtcagagaga 

acagatttta 

acatgacatt 

tattacattg 

tgaaattcta 

aaacagttct 

atcatacaga 

ccatgttgta 

atatataaat 

ttgggtcttt 

cattccgttg 

ttcccattaa 

tctattggcc 

attagttctg 

gtctacatat 

aacatatgtt 

aaatagaagt 

ttaaaaaatt 

atgaaaggtt 

tcttttttct 

tttttgtgtg 

ttgaaccatc 

atatgttgca 

acggaatatt 

aggtagttct 

aattattgaa 

catcaggcca 

cttaaaatgt 

ttctagaaat 

taatactctc 

tttttttttt 

ctaaatttgt 

tacattctct 

ggttcagtat 

gatctttctt 

cccatatgtt 

ctcatgtaat 

gtaaaagttc 

aaatgttttg 

tatgttctat 

tgggtagagt 

ctctgttgcc 

ataactacaa 

ttcatatatt 

gctacattaa 

gtctgatatt 

ttttcatcct 



ggaaacactt 
agctcaagat 
tttcaggtac 
tggaattgta 
atcaattcct 
tcttgataac 
tatataatgt 
actcaggaac 
atagcagagt 
cagctattca 
ctcctccttc 
ctgctccgat 
attcaatgtg 
ttgactgtgt 
gagagagaga 
agaactgagc 
tactatctta 
tgtagacatc 
cccattaaac 
acttattgtc 
atttaccttt 
gtctatgtca 
cacattatat 
gattcatatt 
catgtggaga 
atgatcttgc 
aacatgtttg 
aattcattta 
atgattatat 
ctctcttttt 
gaggaaagca 
gtgatgaggt 
attgaatttt 
tcattagtgt 
ttgaaccatc 
tttgtattcc 
aaattcagtt 
gatctttagt 
ggcctcatag 
aattggggtg 
aggacttttg 
ctattacaat 
ttgtccactt 
tgctgatctt 
tttattttgg 
tactattttg 
attttatctc 
attcttattt 
gttttttgta 
ttgacatatt 
ttattatttg 
tcgtttttct 
tatgatattt 
ccaggaaaat 
gttccgtatc 
ttacttatat 
caattattat 
ttatggtctg 
accttttaat 
agtatagcca 
ttaactttca 



tttatttacc 
gtatttgtgc 
ttttctcttt 
aggcatattt 
ttgtcaggac 
caaaacacag 
gtcacataga 
aagaaataag 
gtgcaaagta 
ttgcacttga 
ccagcgtttg 
ttgctctagc 
aaactacaat 
caaccataag 
gattgacaaa 
cctaattcat 
accattttta 
accatcctcc 
aattcctcat 
tgtatgattt 
ttgtgatggg 
gacttggctt 
atgttatatt 
cagttaattt 
ttcagttttc 
cacccttgtc 
ttttacgcca 
ttagtttcat 
catctatgga 
cttgcctaat 
gacacccttg 
agtttccttc 
ctcaagtgct 
ggtgtattcc 
tttgtattcc 
aggcataaat 
tgctagtact 
tttctatttt 
aatgtgttag 
agcccttctt 
tttgtcagga 
tttctatttg 
catccaggtt 
tgtatttatg 
taacttgagt 
aagaaccaac 
agctgtaata 
ttttagttcc 
atgtatgcat 
gtttgttcat 
atccattggt 
tctgttactg 
tttaccgtaa 
gttccatgtg 
tgtccattag 
tctgtctcat 
agaactgtat 
ttattagatg 
aatatataat 
ctcttgctct 
gtgtatctgt 



agtgcatgag 

tatagttata 

agctgtttta 

ggtggttcta 

atgacattat 

agggttatca 

gtattttatt 

gatgcactca 

aagcaaatgc 

ttctttcctt 

tatggcagct 

ttcaaaggtc 

ctgtgtaact 

cagaataata 

caaacaaaca 

ttttttttct 

aatgaacagt 

atcttcaggg 

tccccccctt 

tgatcactgt 

cttatttcac 

cccttttaag 

ctaagcattt 

ttatatggag 

tcagcatctg 

aaaaataatt 

gctgatataa 

atgggatttt 

cagagataag 

tgctctagct 

ctttgttcct 

tatcctagta 

tttactgcat 

attgttaatg 

aggcataaat 

cctacttaga 

tttttgaaga 

ttgtgatgtc 

gaagtgttcc 

taaatgtttg 

tatttttgat 

tttgtggttt 

attcaatttg 

tagacttgga 

ctttcttctt 

ttttggtttc 

tttattattt 

ttaagttgta 

ctaaattttc 

tttcgtttat 

aagagtatgt 

atttcttact 

gttttttgag 

tacttaaaat 

acctagtcgg 

tgttctattc 

gtttctgttt 

cataaatatt 

gtccttcttg 

tcttcactta 

gtctttgggt 



agtttaggtg 

aatatagatt 

ttttagtata 

gtttacaact 

gtttttccat 

ttatacagtt 

tccagccaaa 

cacaaataga 

atccatgccc 

caactccttc 

tcgtaataaa 

tatggtcatg 

ttgtatttct 

attcctgtgc 

aacaaacaaa 

tttttattgt 

ttagtgatat 

ttctttccac 

cccccagctc 

aagtacctca 

ttagcatcct 

gctgaatgat 

tatattttta 

gttcatataa 

ttgttaaaaa 

tggccatgta 

ccatatgtaa 

ttggtggaat 

tttacttctt 

aggatttcca 

gatctgacag 

tactgagtat 

caataaaaga 

tggtatattc 

cctactttga 

aatggtgtat 

ttgttcccat 

tttgtctgac 

ctcttctttg 

gtagaattca 

tagtaatcta 

agatttagta 

ttggcatatc 

agtaatgttc 

tctttctagc 

atttattttc 

catttttttt 

aagttaggtt 

taacagtgct 

ctccaagtat 

tgtttaattt 

ttatcccatt 

acttaatttg 

aatgtgtatg 

tgtattgtgc 

attattgaga 

tcaattctgt 

tataattttt 

tatgttttaa 

ctatttggac 

ctaaagtgag 



53160 

53220 

53280 

53340 

53400 

534 60 

53520 

53580 

53640 

53700 

53760 

53820 

53880 

53940 

54000 

54060 

54120 

54180 

54240 

54300 

54360 

54420 

54480 

54540 

54600 

54660 

54720 

54780 

54840 

54900 • 

54960 

55020 

55080. 

55140 

55200 . 

55260 

55320 

55380 

55-440 

5550O 

55560 

55620 ■ 

55680 

55740 

5580O 

55860 

55920 

55980 

56040 

5610O 

56160 

56220 

56280 

56340 

56400 

56460 

56520 

56580 

56640 

56700 

56760 
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tctattgtag 

gattggagag 

atttgttttt 

gtgtttagtt 

ttctttcttc 

aacattctaa 

agctctgtcc 

gttcgaaaac 

aaaatgtgga 

tgttttagtg 

tactagcttt 

attgtctagg 

ggtctagagg 

cattttagat 

tattttgtat 

aactctgata 

tttattgttg 

tcaggttttt 

acccatatat 

aaaaagagaa 

ccacctcagc 

gtttctttgt 

tgttatctgg 

ctgctccagg 

gtgctaatag 

gagtttgaag 

tagtaacatc 

ctagtgctac 

catttctggg 

cctgctttaa 

atgaaaacat 

aaaaattgag 

tggactgtct 

ttatggggtg 

gtgaggtgac 

aaggatgtca 

atgagttgca 

ggtagttttc 

gtgagggaag 

gctgttctta 

attattattt 

ccaaatcttt 

ggcaattgtg 

tgtggggaag 

aaaatttaga 

gtacctcagg 

agacttgggt 

cttcagcagg 

aatccaagct 

tctatttgaa 

tgtagctacc 

gaaactgact 

tctttcttat 

tgaggaagtc 

ttggtgctca 

caaaatttgg 

caaacctggc 

aattttattc 

tgcttgttcc 

tgcgtcggac 

tgtgtgactg 



actgggccta 

cttaatccat 

tatatgcctt 

catctagttg 

tatattcttt 

tttgaatttg 

ccacctcttt " 

ataaactaat 

attgcaaacc 

tcttaaatga 

taaaattgcc 

gtcctttaat 

taataaactc 

aacagttttg 

gccttctgat 

accggattct 

cagctgtctc 

ttctgaactt 

acagttgctt 

aaacaaaggg 

caggggagag 

ctgcacctct 

caggcagggt 

caccgctgcc 

ctaaaattga 

tgcaagtctt 

agacagattc 

ctactccacc 

gttgtaagtc 

gttttcaatc 

ttggctctaa 

agtaaaagaa 

gaaacagttt 

tcattaattt 

attcataaat 

tgtagatcat 

gccattttaa 

tgatagtaat 

actgttttta 

ttccatagca 

tcagaagtct 

ataatgctgt 

cagtgggatg 

agtaaccagt 

tgttatagta 

tgatactgtg 

attgacaaga 

ggaaatacag 

atataaagag 

ttcaggttct 

agccttgaag 

atggtgaatt 

tcatagtcta 

cgctgtggtc 

atgtcatgag 

cgaacccttg 

ttcttcactt 

atagatgtaa 

tgctttgatg 

aagtatgaga 

tagaccgtac 



catcattttt 

ttacacttaa 

atagctttct 

atttttcagt 

gttgttacca 

tgccaactta 

cagctgttca 

aattctttga 

aaaattacaa 

agtagaaaac 

tatttttaat 

ttcatcccaa 

ccccagcttt 

ctgaatctag 

tttcacttga 

ctttctttct 

tgtgccaagt 

gtatcttccc 

gtgaatgtct 

agaaaaaatg 

gggctgacaa 

gggatcagaa 

tgtttgttgc 

tgccacagag 

ccacaattaa 

tccctggaac 

tgtcagggca 

atctttctag 

aacaatctgt 

cagttagccc 

gtgtggttgc 

cgtgctatta 

tttgagagac 

gtttttgtcc 

ctgagtattc 

ataaaataac 

tattttgtcc 

ttccaattcc 

tatcattcaa 

tcttatgtag 

tggtttttcc 

tgccaacttt 

gatatgtgaa 

ctggattgaa 

gttcttatat 

atagctataa 

acttggaaaa 

ttatgttcct 

acataataac 

aagcagtatt 

ttacctgaac 

cttaccccaa 

tgattggtgc 

ctaaagctgc 

tactaagatg 

tcttttgggg 

ctgtgtacga 

aattttggca 

cgctacttat 

agacatgctt 

ctccatgaac 



taaaatccat 

aatcattgct 

catacctcat 

taaatattaa 

tggtgattat 

attttaataa 

tgctataaaa 

tgcattcatc 

taccaacttt 

aaaatgaagt 

tcagatttaa 

ataactcccc 

tgtttacctg 

tattgttggt 

aaactgagca 

gagggtttgc 

attagcatga 

actgtgtcta 

ttatcttcaa 

aggcaecacc 

taattgagga 

gcagtgatca 

ccaccctagt 

ttgagaggtg 

ccacaactga 

ttaatttcaa 

ttattttttt 

aatattctcc 

attttaacaa 

tctctatcct 

catttaattt 

ataattcatc 

acttagaaaa 

aatttttgca 

atatctaaaa 

tatagaaata 

atttttatat 

tctcccaaaa 

attttatcag 

gattgatctc 

attggcaaat 

gtgtctttaa 

tctgtgtgaa 

ttgaatgcta 

aagaacatta 

ttgtttatga 

ggatcaagca 

gggtcactgg 

tatgatcaga 

ccccaaagag 

tttggtattc 

atgtattttt 

atcttctcta 

acctgtgtgg 

attctgagta 

aagaggggca 

gataatttca 

ttttggctaa 

gttgtgtttg 

acagtcacat 

attacaaatt 



tttactaacc 
gataagcaag 
tttctacatt 
aagtcttttt 
attcaacatg 
tacactcaaa 
ttatatctta 
tctcaaatta 
tagcataata 
tatacagcgt 
tttttgtata 
tgagcatttc 
gaatgcctta 
tgacagtatt 
tttgaatcta 
tgttttgttc 
ggtgtaaact 
catggtgact 
tgtcagccgt 
ctctttacat 
ggagcataac 
gtgatcagag 
ttctgcaagc 
gggaatgaat 
cagtctaagc 
tagtttccag 
ctatgtaggg 
tccccaataa 
tttctctagg 
caatataaaa 
atgttccaaa 
tgtaataata 
aattatgctt 
acaaaatttt 
acccaaaagc 
ttgaatccat 
attctatgca 
tggactggac 
tccttttctg 
ttccttgtgt 
gttcaaataa 
ataaagtcca 
aaatgatatt 
aattggcttt 
aacatctcac 
cggaaatatt 
gcaacatata 
actcttgtta 
actttgtgaa 
atctgatatt 
tattattccc 
ccctagtaga 
gctccctttt 
ctgttgtcta 
agagtcctaa 
gaattcttca 
aactgctttt 
tttaatttca 
aaacatagta 
aaaaggagaa 
tctgactgcc 



tctgtcttct 
caattttgct 
tctgtgtttt 
ttctgctatt 
ctaaagttat 
tattttttac 
tacatttcat 
tgtagaaaac 
attatcttaa 
gttacaataa 
gcttttagtt 
ttgcagggaa 
atttctccct 
ttacacttag 
ataatatgtt 
ttgacttttt 
tattctctcc 
tcctaatttc 
ccaaaagagt 
tccctgggaa 
aatggccact 
cacgggtctc 
tgtgtggtag 
aggtgctatt 
catctcctgg 
aattccaaaa 
atacatattt 
tttatatcat 
tgtctctttg 
caaaagcacc 
ccagaaaaaa 
tatatgaacc 
tggaaatttt 
gacttctata 
atttttttct 
aatatgtcat 
tataatatat 
agatttctta 
tgggtcccct 
tagagtcaat 
aactttataa 
attgccatag 
ttaaatgctt 
acatttggaa 
tctaaatgaa 
cactcttaga 
tgaaactgtg 
tggactgaac 
atatgaccgt 
gaatcaatgt 
ttagcttgta 
tgaactataa 
ttagatgtgc 
atgtcccgtt 
ttaatgccac 
cgatatgcct 
ctcactttta 
aatttgtatt 
. gaaaagaggc 
taaaaatatg 
atctagaaca 



56820 
56880 
56940 
57000 
57060 
57120 
57180 
57240 
57300 
57360 
57420 
57480 
57540 
57600 
57660 
57720 
57780 
57840 

57 900 
57960 

58 020 
58080 
58140 
58200 
58260 
58320 
58380 
58440 
58500 
58560 
58620 
58 680 
58740 
58800 
58860 
58920 
58980 
59040 
59100 
59160 
59220 
59280 
59340 
59400 
59460 
59520 
59580 
59640 
59700 
59760 
59820 
59880 
59940 
60000 
60060 
60120 
60180 
60240 
60300 
60360 
60420 
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cgtgctttat 
acatgtttgg 
aactacacaa 
ccctgatgac 
gctaatgcaa 
tcctaatttg 
agtggagctt 
aaaaataaat 
ggttatataa 
tcaagtaatc 
tgagaagaga 
atgaagaaat 
atcatgtatt 
cagctataac 
tgtttttttg 
gttactaaaa 
tttctgtatc 
ggatgtaaag 
ttgagtctat 
cttttgatat 
ggaaggaagg 
ggaaggaagg 
gggagggaga 
gatggaagga 
gaaggaagga 
agggaggaag 
aatatttttt 
tgagcttgac 
tggctgatag 
ctaaaatgaa 
atgtgcagag 
agatgtcaaa 
acattgacat 
tgtccacatt 
aaaagaaaag 
tttaatctct: 
accttgtatt 
gacattctat 
tatattgtgt 
actaaaagga 
agtgactatt 
aattgcaacg 
aggtataagc 
aaatgaactg 
■ggccaagcta 
aaatcatttc 
agctcagcta 
cttctttgaa 
tgtttatgaa 
ctttttggat 
tatatctaga 
agtatagtga 
tggggtgtgc 
atttaaaata 
aatactgatc 
tcactgcatt 
ttaccaatgc 
ctacttaagg 
gttcctgttg 
tgtaactggg 
agtctgcatc 



caacatgata 

tttagatgtg 

aaatcaccct 

agcatgtttt 

aattatcttt 

ccccttagca 

agcatttttf 

gagcatagct 

agtgaaagga 

caaaggtatc 

aggtcctgat 

attgtaataa 

ctgtacacaa 

tttagaaagg 

gtggaaagag 

ctgaaaatta 

agtgttagac 

gagtaagcaa 

ttatagtaga 

gcatacattt 

aaggaaggaa 

aaggaaggaa 

aagggaagaa 

aggaaggaag 

aggaaggaag 

ggagggaggg 

aaaaagaaac 

ctggatctgt 

tcttaaaaga 

tgttgaaatt 

atatgggaat 

gctctttctg 

acgtgcatta 

ctcagttttt 

ctaactgtga 

atgactttca 

ttaattagga 

gggtgtgcgt 

aagagcagat 

aagcgtacca 

aaaagagaat 

tttgcaaatg 

aagtattttt 

actggccttc 

tgaattatta 

aaatagtgaa 

agttttttaa 

ataatatctt 

ataagaataa 

atgtacttca 

tgtagacaca 

aactgccaga 

caaatcccca 

ttaagatcat 

tcggacttct 

catgcgtgtg 

cacttttata 

gagtgcaggc 

gtcctgtgaa 

ttgattgtgt 

atagcaaata 



tcaatgatgt 

acctttgacc 

aaaagtaatt 

tgtagttaga 

actagctgga 

ctcacaaatg 

"attctgatag 

tcaaacattt 

atttgggtaa 

aaatgttbaa 

gagtggctaa 

aatttttgaa 

atgtattctt 

ctgattgtta 

agatttaaaa 

gggaaatgta 

cttttatggg 

agacactaag 

atcactagaa 

tccctgaaag 

ggaaggaagg 

agaaggaagg 

ggaagaaaga 

gaatgaagga 

gaaagatgga 

aagaaggagg 

tctttgatga 

aactctt'gac 

cagaatccaa 

ataaaataaa 

acatatgtgt 

catgcatcct 

aactaatatg 

tcatcataac 

aactcaaaga 

cattcttagc 

tctttcagtg 

gaaagagagg 

gacaaccttg 

gtttggatta 

atataggaga 

ggcttgtcac 

catttttaag 

tacaagtatt 

aaagaatatt 

atagtattat 

tcaatgttta 

tgaaaccaat 

acagcctgta 

acttaaacta 

tcccccaaaa 

ccacaagcga 

caccttgttg 

cctctttagg 

tgtgagtttg 

cttgaagctt 

agattttgcc 

aagatgacct 

ttcctaatgt 

tgcatgacat 

cgttacggga 



ggatatttta 

atttctgtat 

ttccatgatg 

ccttgggaaa 

tctccttgga 

tcatctcccc 

aagagactgt 

ctttaataga 

ttaatatgaa 

tgacataaaa 

gtagtgatga 

agatactcag 

tccattttat 

ttctttggct 

taaaatggaa 

gaaaatataa 

tgaatgaggt 

accagagcat 

agtggtgacc 

aaaaaaggaa 

aaagatggaa 

aaagattgag 

aggaaggaag 

aggaaggaag 

aggaaggaag 

gagggaggaa 

ttcaacatca 

atttagctct 

cataataagc 

atacaatata 

cttatgtgtg 

tgcttagacg 

agaatgtatg 

tttttctttc 

aagtctttgt 

ttcagtaatc 

tttgtggcct 

accaaggttg 

agattcgatt 

gatacattaa 

aaaaggtatt 

ataccatcat 

gatcattgca 

tacttacaac 

gacattttgc 

actaattaaa 

tttcagttag 

gtccacattt 

attttaatat 

catatatagc 

cctatagaaa 

tgttttgtat 

tcaatattag 

atctacttgg 

atgtgtttta 

gctgccttgt 

tgcagtgtgt 

accctgagtc 

ctttataata 

taagtaaacc 

gagaggggca 



cttactttct 

aattttaata 

cattttaaag 

gacaacaagg 

ctggtcattt 

aaagccatag 

gtccatgcaa 

aacttagcct 

gtcaaacatt 

tacctctcta 

aagtctaagg 

agatatttct 

tttcagtgat 

tttctccctc 

gcaagtggta 

actaatatca 

taaatgtgtg 

gactttatat 

acacagagtt 

ggaaggaagg 

ggaaggaagg 

gtaaggaaga 

gaaggaaaga 

gaaggaaaga 

gagggagcaa 

ggagggaggg 

tggagaagtt 

tgaaaataca 

ctgcacatgt 

tgttcacaaa 

cttatttcta 

ttttgtctga 

cattaaacta 

cagagtgcat 

tagggcctga 

cttgaagaga 

tgaggagcga 

ttttctttgt 

tgtgatatga 

atcagtatgt 

agaagttgca 

gtaagtacta 

aagatagaga 

cacaattgta 

tatattttta 

ctttattaag 

gggatagttt 

ctttacagat 

caccaaaact 

tatagatata 

tgtatttctc 

ttatagatct 

tttaacaaaa 

agcgctgaaa 

catttgtaat 

aggagacaat 

caatgtgact 

atccaaagaa 

aatggccacc 

taactgccgc 

cgtcctccct 



attttgttat 

aaatagattt 

gtcatgtttt 

agatctgaga 

atcctctgaa 

ctgagcagaa 

ttatgacagt 

ttgggtttaa 

aaaagattcc 

gtccttatga 

ctaaatttct 

tgagatcatt 

tttccatgta 

aattaaatca 

ttaattcaga 

agctaattct 

ttagtataaa 

ctatatcagc 

gcattttttt 

aaggaaggaa 

aaggaatgaa 

aaggaaggaa 

aggaaggaaa 

aggaaggaat 

ggaaggaggg 

aggaaggaag 

attattatat 

tctgttttct 

atccccatat 

tattaaaaaa 

tgaagtcgtt 

aattctgttc 

gcttgaatga 

gatgccgcag 

gtattttttt 

acagagtcaa 

gagcaagaaa 

gactagagat 

aaccaaatgc 

catgtaaatc 

ttttttaacc 

gtgaagtcta 

atttgacaat 

atttttccat 

aattatcttt 

tacattttca 

tgtttttttt 

aaaagagttc 

attttaatct 

tagatatgtc 

aaatgtgtaa 

acttgcaatt 

ctaaaaggaa 

aaggtgtgaa 

gctattctct 

gcaacaactc 

cttcagtacg 

tatttctcta 

taaaaattaa 

aacagggtgc 

cagcgcccac 



60480 

60540 

60600 

60660 

60720 

60780 

60840 

60900 

60960 

61020 

61080 

61140 

61200 

61260 

61320 

61380 

61440 

61500 

61560 

61620 

61680 

61740 

61800 

61860 

61920 

61980 

62040 

62100 

62160 

62220 

62280 

62340 

62400 

624 60 

62520 

62580 

62640 

62700 

627 60 

62820 

62880 

62940 

63000 

63060 

63120 

63180 

63240 

63300 

63360 

63420 

63480 

63540 

63600 

63660 

63720 

63780 

63840 

63900 

63960 

64020 

64080 
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aacagggatc 
atctttctcc 
aataaaactg 
ttttaaaaat 
ttgagaggcc 
agtgaaaccc 
tgtagtccca 
gttgcaatga 
tctcaaaaaa 
tgcttttttt 
acagaatata 
aaatcaagaa 
gtctggaagc 
gtaccactca 
cccccacctt 
ggtttagtga 
gccaagctgc 
ctgagatgtg 
agaagacctg 
tggatgtcct 
acataggtta 
atagcaaatt 
cttagacatc 
ctttctgaga 
gttccttggc 
tccatcacgt 
tctattggac 
acacacagaa 
ggcattgatg 
catatagaaa 
tagatagaag 
agaaaatgcc 
acacacacac 
ggtagattta 
aacaatactg 
aggatgtgcg 
agtaatgaaa 
attcaagaga 
aaagttgctg 
gcatatatta 
tttgtacatg 
attttaaaaa 
aaaaataaca 
tttcaaaatg 
aaaattttgc 
tcttacagct 
taacgttcta 
tgagaggaat 
atgttccctt 
tgactgagac 
agccttagga 
gtcagatggt 
gaaaactttg 
tgaaatttcc 
tcatgatctc 
ttttcaatca 
aaagattttc 
gggtaatttg 
atagcaccga 
ctcacatggc 
tcccttaaga 



attaaccaag 
acacagtgct 
gcatggctgc 
gaaactattg 
aaggcaggca 
cgtctctagt 
gctacttggg 
gcagatatca 
aaaaaaaaaa 
taagtttatg 
aagagacagc 
gaaaaggcaa 
aggtactcac 
taacgtgact 
gcaaaataat 
ggtttactta 
ctgttggatt 
gctcaggatg 
tgtgactagt 
cttgttaaaa 
agtatcttcc 
gccacaaact 
caacatgggg 
ctctagagaa 
tcttggctgc 
tggctctgtc 
tcacctggat 
aatcacctct 
tgtctaccac 
taaatttgtg 
agtttccaga 
aaggaatatg 
acacacacac 
gagactccga 
aaaaaagtaa 
.tgtctcgtac 
ttgtagcacg 
tgtattttac 
agtagatttt 
attagcttga 
ataaatacat 
ttggaaggcc 
attaggcctc 
ctttggcaaa 
actcagataa 
cagttttcct 
ttatttgtgt 
gaagatactg 
gcatggcact 
tacatttaca 
acttccattg 
cgaaaaatta 
tggttcttta 
ttggaatcag 
aagttattgt 
ggagccaatt 
tttagtgtgt 
taaaaagaga 
catctgctcc 
aggagcagga 
actcactcac 



gctgacacaa 
ccaggaagcg 
cacatagatc 
tcggccaggc 
gatcacgagg 
aaaa.atacaa 
aggctgaggc 
tgccactcta 
agaaactatt 
ttttgaaatc 
ataaatgata 
tgataaaaag 
agtggcattc 
ggccagcaaa 
gctgaaataa 
cagaaaataa 
tgacatatcc 
ggagaccaca 
gagaaaagca 
taaatgtact 
aaaaaagaca 
tagtagctta 
cccactgggc 
aaatygtttg 
ctctccatct 
gctcacattt 
aatccaagat 
gttatgtaaa 
agcatccaca 
gattcaggaa 
ctagacatcc 
cgaaggaatt 
acacacacca 
catataacca 
ggaattttcc 
tgagaaaaat 
tcaaaatcca 
aatagggtaa 
aaatgctctt 
tgtagccatt 
agtcaaaata 
atattcttaa 
tctagaaacc 
aacaaaaaac 
agcattaatc 
atctggatta 
gtccaaacat 
atgacagaaa 
cttcagttga 
caaattaagg 
caaaaaagga 
attggcaaac 
ttaaacgaaa 
gaataagggc 
ttctcataca 
ctaatcacac 
taggttgttc 
ggtttaattg 
tggggagtct 
gcaagagtag 
tattgcaagg 



ttctccagtg 
attcgacatg 
ctattttctt 
gtggtgactc 
tcaggagatc 
aaaaattagc 
aggagaatgg 
ttccagcttg 
gtcaaccaat 
ctctctcaat 
gaaaacaaaa 
ggctgaagcc 
ttgggcactc 
atttactgct 
aaggtgttac 
aaaagcaaac 
tgcaatcaca 
gggcaggcag 
agagtaagag 
aaaacaaata 
cagcattcat 
acacaacatg 
tcaaatccag 
ctttggtttt 
tcaaagtgca 
ctacttctct 
aatcttcatc 
gtaacatatt 
aagggaacaa 
aaagtggagt 
aggaaagaat 
gcatgcatgt 
catagaaaag 
tgaggtttac 
acaattgaag 
aaatgaaaat 
tattctcttt 
ctacagttaa 
actataaaaa 
ctacagtata 
atttaagaat 
aaactaaaca 
taactcttaa 
caaataacag 
aagatgaaat 
actttagttt 
gcatatttat 
ttagagcagg 
caaaaggaca 
agttgtgtaa 
gtttaacatg 
aagttgattg 
acacttcgga 
ctttatatat 
tatcagagat 
atcagtggca 
ttgtgttgct 
gctcacagtt 
tgaaaaagaa 
ggagaagtgc 
acagcaccaa 



agaacagatg 
agtctgtact 
ttgcctctta 
acgcctgtaa 
gagaccatcc 
tgggcgtggt 
cgtgaacctg 
ggcgaaaaag 
gagtgtaaag 
tgcaggaaaa 
taagcatgtt 
ggaggcatcc 
tggggcaccg 
taacataaac 
tgatcattgt 
agtcatacac 
gggaaaaggt 
gaaggaggca 
aaaggaagaa 
gaacaatatg 
gtgagcttca 
cattcattac 
gtgttggcag 
tcagctctgg 
tcactgcagc 
cttgcaagga 
ttaaaatcct 
tccatatcct 
aaaatattgt 
catggaacaa 
tcacccagga 
acatgcgcgc 
aacaattcac 
aataaaaaga 
aaaaacatga 
aaatccgcac 
gggtttgaca 
taatatttat 
tgataagtat 
taagtatttc 
ttttaaatga 
tgagaaaagt 
aaacagacat 
ggaaagagct 
acaaaatgaa 
tgcaatagta 
ccagcaacta 
gaatcagaag 
tgtgatccca 
cattaaataa 
gagctgttgt 
cataagctgg 
tattgggaag 
tctactcact 
gtgctaaaat 
acttggagcc 
ataaagcaat 
cttcaggccg 
ggcaaagggt 
cacactttta 
ggggatgatg 



taaccttgga 
ggcataggaa 
ccaagagtta 
tcccagcact 
tggctaacac 
ggcgggcacc 
ggaggtggag 
tgagactgtg 
gtgttatttt 
aaaataagat 
catgagccag 
caggccctgg 
aggtctgtaa 
aatgtcttgg 
ctgggactaa 
ccaagattgg 
tggtactata 
aaagcaatct 
agagagaaga 
aaaagaccct 
tattgctact 
tttacagcag 
agctgctttt 
aggctgctgc 
ctgcgcttca 
cccttgggat 
taatttagtc 
ggaggaaggg 
ccaataaatg 
aaatatttaa 
tgcagcttag 
gcgcgcacac 
agataggagg 
ataaaggaga 
gccttcagat 
agaggcacaa 
ggaggaataa 
tatactcttg 
attaaataat 
aaaacaacgt 
taaaaaatac 
attaccaaca 
tacagaatac 
ctatgaatta 
gtacatggtt 
tttcttgcag 
attcaaggta 
agagaagtgc 
agatttctct 
ccaaagtcat 
taaagggagg 
aagtactgtt 
acagcaacag 
gtctacatgg 
ttgaaatagg 
ttgagt.ttgg 
atctgagact 
tacaggtagc 
cagcaggtgt 
aacaatcaga 
ttacaacact 
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catgagaaat 

ggattacatg 

-tccaagacaa 

actgaaatag 

ctgataaatg 

ttgccaatct 

gaattctgac 

tctggtggct 

gaagtaaagt 

gtttaaaaga 

ttgaataaat 

ttcatattta 

ctaagtttca 

tcgctttaaa 

tatcagttga 

ttaggcaata 

gaatccaata 

ttcaacaatc 

ttctaactcc 

gtgcaagcta 

agagaattgt 

taacattatc 

gcaagactga 

tccaccatgt 

gaatcctttg 

gccaaaattt 

aatcactaaa 

agattgaaca 

cacaaatgat 

tttttaaagc 

fcatttgatga 

ttaagtcaga 

acttacagtc 

tccttttcta 

tagtttttcc 

actgtttaaa 

aaataagtat 

ttcaatattc 

tatccaccaa 

tatccatcca 

taagaaagaa 

tgtggatgtt 

ttttctaaga 

tctactgaca 

aagacaaggc 

acgtccatca 

ttaccatgat 

gtgttagtca 

acatttattt 

ttgaggacca 

tagcagcgac 

gagctaatca 

aatatgaact 

agtagcagaa 

tcaacgctta 

atacacatat 

catatataca 

ttttctctct 

ttttttcttg 

agtgttaaca 

cttgtactat 



ctatcaccat 

tcagtgtgag 

agtctactaa 

aaggaatggc 

tttaacaata 

ccctgatata 

cacagctgtc- 

taatgctgaa 

atgggtgtga 

aaagaatgga 

tggtgtgttt 

tggaggtttt 

aatcaagata 

aatattttag 

tccagtttct 

aagataggga 

gcaatatggg 

accaaccatt 

agaagtcctc 

ctctttataa 

tgaccagatg 

ttaaagtatg 

atatcttctg 

taccaaatat 

cctaaaatgt 

gcaattcttc 

tgttcaggga 

gacaaccatc 

gatggattag 

catcaatttg 

tctcctaata 

tactatctct 

acaaaaatga 

cgatgaaatg 

tttccaaatt 

aacaatgtct 

atttttagag 

aattttattt 

ttcatccatt 

ttcacacatc 

gaaaaattta 

tgtaatttta 

cttatacatt 

ttttattggt 

catttgacca 

ggctgaaaaa 

aaaacatatt 

gttctgtcta 

ctcagccagt 

tccttctggc 

ttcctggagt 

cccaaagtct 

ttggggcaca 

atctttcatc 

gagtctgtct 

acacacaaac 

ctgcatctag 

aactcataag 

caaataagag 

atattcacaa 

atttataaca 



ggtctaatca 

actcaaagaa 

agtctgagat 

cacagaggtc 

ggttttcata 

aatattccaa 

tcttgctagc 

acttagtatt 

gaaaagtttc 

aatggctcgt 

tatcctggag 

ttttttttga 

aatctagcct 

aaaaaaatgt 

acactttaaa 

aggttcaagg 

aaggcaggaa 

cctgctcccc 

ttgaattttt 

tattgcaatc 

aataaaaagt 

cttctcagga 

cacctgtttt 

gattggcatt 

agattttttt 

tataaaatta 

aatctctagt 

tgttcaactt 

gtaacttcat 

caizgaagctt 

tgtgctaggt 

tttttcagat 

agtacgctta 

tagcttcatg 

tttagtatag 

ttgatttaat 

ttcaaagggg 

attttttata 

tatctacctg 

cattatttca 

ttcaggcttc 

ttattgccat 

acctctgaaa 

caaagcacat 

ggaaacacag 

atgtggtata 

taagacaaag 

ctctaacaga 

tctgatggct 

tgcacactgc 

gacaagggct 

ccacctccta 

caaatattta 

tcttctaaaa 

tggagagcag 

atttgtttgt 

cacaataggt 

ggagagtttt 

tggaatagaa 

gtgtttaaag 

tgacctttgt 



cctcccacca 

gacacatatc 

caattaattg 

ttgtaggttt 

ggtggtcagg 

ccatggatga 

tggtgtgagt 

taaagttgtt 

agttagttta 

gggtcgcata 

aggagggagc 

cagaagatgg 

taagtattcc 

actgtataaa 

tcctgctggc 

gaaaacttct 

c.tattttacc 

cacatgtcat 

cccagcctgt 

tgttcttatt 

tggccaccaa 

gggcatttta 

cattggttca 

gtgttaaatg 

ttttgctata 

ttttttgtct 

gagaaaaata 

aaaacatttg 

ttcatattta 

atgagtgaga 

gctttatatg 

aaagaaattg 

tacaatctga 

taggggagta 

atgaagaaaa 

atatggtaaa 

aggaaaatat 

atgaaatagt 

tccatctatt 

tcctgcattc 

tatgtagatg 

ttcctttcca 

tgcttcctct 

cacatagtca 

gagatccaga 

aatattaata 

gtttttacta 

ataccataga 

ggaagtctga 

tgacttctcc 

tcaatcccat 

ataccaacac 

gtccataaca 

ctaacactag 

gaacagatgt 

atgtgtgtgc 

aattgctatg 

ttaatcagct 

agattttcgc 

aaataattag 

tctatttata 



agtctcacct 

tgaactatat 

tagatatctg 

gtggcttgtg 

gaaggcctta 

tttactcaca 

cagctccaga 

atcttttgtt 

gttagtattt 

attgagacta 

tggagagtga 

ttcccagtta 

cctttcctat 

gaatgctttc 

cttttacact 

gaggccaagt 

caataactct 

tcatcacact 

cctgtcaaat 

ccactttaaa 

tttgacatta 

agtcatttga 

atatgaaaga 

ttcttaggtt 

aaaaagacat 

aaatcttaaa 

aaatttatag 

tctcaaataa 

ttaaatta.tt 

gatatacaaa 

tatgacattt 

agacttaagg 

atctactgat 

taacatggaa 

agacattctc 

tattaaatat 

cctttgtaag 

cagcactcca 

catccatcca 

atacatccat 

gaagtgtttt 

aatgtatgcc 

tggaaatgat 

-tgcctgagcc 

aatattggtg 

tttaagacac 

agagcaaaaa 

ctgggtggct 

gatcagggtg 

ttgtatgctt 

tcgcgagggc 

cttgaggatt 

gccttggagg 

tggaaaacat 

attatattac 

ttatgtggat 

tcagggcaca 

ggtaatatca 

tgttgagaat 

gttcatttca 

tatacaagta 
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ataggaattt 
tacaatagtt 
gagatcaatt 
tcttgtccta 
ccttagacaa 
ttgtaaaaat 
gaaaacaaat 
tattatttac 
attctaaata 
agatctttgc 
tggttaatac 
aatgcctaaa 
atcaaagagt 
ctaatttatc 
atctgggttc 
cctcagtttg 
actcaataat 
gaatattgac 
tcagtcacgt 
cattctcaca 
tgatgatttt 
caggcacgga 
atgacataat 
tcggagctgg 
cctttggatt 
caaatagaga 
tgcagggcca 
tggactggct 
gaaaacacag 
ctcactactg 
ctaaattggc 
atccagcagc 
ccaaatgaaa 
gaggaaattg 
actccaggga 
gtagacgccc 
tgtctctttt 
taatttgtgc 
tgcttatttt 
gtattcttca 
tgaaatattt 
tgttgcttcc 
taatagaata 
tatcacaact 
agaacagata 
ctaatattga 
ttaattgctg 
cagtttttta 
atttaattgg 
ttccaacctg 
caggtctgct 
ttttaaggca 
ctttcaactt 
ggagtgcagt 
tcctgcctca 
tttttttttt 
tagtctcaat 
caggcgtgag 
tagtttcatt 
attttctatt 
atatttgaat 



ttttgtgaag 

cctcaatgac 

tcctgatgct 

tcatttgatg 

tacagtcact 

tgcttttttt 

tttgggcaag- 

aaatatgaca 

tgatgtgaca 

tatgcagttc 

acacatgtaa 

tattatgttt 

acattaatag 

ctcttataga 

agtaagcacg 

tactggaatc 

gggattcaaa 

tggcacttca 

ggggatgatt 

agtttttaag 

aaaggtaatg 

gtctgcgtca 

gaattatatt 

tgagagtttt 

ccttgatcct 

atgtagagaa 

acatgaaaga 

agtgtcgggg 

tgatgcagat 

tgatcataca 

aggctttttt 

aaataacatg 

atgagtcaag 

tgagcagggt 

aaaatgagaa 

gttacggttt 

gagtttataa 

ttttttctat 

ctaatatagg 

aattttacct 

cctaatttcc 

caaatatttg 

ccattatagt 

tgctttgtgg 

aagtgtattc 

tattgttgaa 

aaaaagcatg 

aattgttaca 

aaatttttgt 

gtatttttcc 

ccagacgatt 

ttgtctcagg 

tttttttttt 

ggcgcgatct 

gcctcccgag 

tttttttttg 

ctcctgacct 

ccactgcacc 

catcttatga 

catgtttggt 

gtatttactg 



aagaaacact 
cttaccttgg 
cagtttatat 
gaaacgattt 
tcaaacagtt 
ttttttggtt 
aatagtggca 
tcaaacatta 
aagttaactc 
tgaacactgc 
ctctacaact 
gtagttgtcc 
aaaattaacc 
aaaatatttt 
tgtttcgagc 
attaatgcat 
tgctgccctg 
gaagtcacca 
tttttggcaa 
cccaggacag 
tcagcctgag 
tcttagtccc 
taacttttcc 
aataaagtgc 
cctctgcact 
gtaaagtgct 
agggcagaaa 
tttcagccag 
caatatatac 
tttgggcttc 
cctaacaact 
catgtccaac 
cattcttttt 
acaagaagct 
gtgttgaaca 
ctccatgtat 
gttcccatca 
ctttaagata 
catttaaaga 
ttcaaaattt 
cttgtcagtt 
ggaattttta 
cagagaaaaa 
ttgagatggt 
ttcaattgtt 
gtcttctagg 
tttaatgtct 
tttatgtagt 
tgttttaaaa 
tcttcagcct 
cagatttttc 
atatttgcta 
tttttttttt 
cggcteactg 
tagctgggac 
tatgtttagt 
tgtgatctgc 
cggccatctt 
gcacaatgcg 
tttcatcacc 
tttcttatat 



tgataatatc 
gaaaaaatta 
tcatcatgct 
ccatgtgttt 
caactctaca 
cagagtattc 
tacacattac 
gtggttttat 
cgaatttcct 
catgctttca 
tgatcttagc 
ggagcatctt 
aaaactgaat 
cagtgtttat 
atactttgag 
aatctgaagt 
tttgatctgc 
gagtattctg 
agagcatttt 
tctccattcc 
aaacagtgaa 
taactctcac 
tgctgctcat 
ttagcatcac 
ttatgtaatg 
tcatgcaaga 
ggttccggac 
aaccttcctt 
cagttttgtt 
ccaacttttt 
gtgaaatgct 
ggcgaggatg 
tctttaagca 
gcacatgcct 
agcagcactc 
taaattggtt 
atattttatt 
aaagcataga 
tgaaagtttc 
tatagggttt 
ccacttttac 
cagatatatt 
tgtcatgtat 
gatctacctg 
gaatataatg 
tccttattaa 
gtgattggag 
ttaaagcttt 
atgtttaccc 
gaagaagtta 
tggtgaacat 
tcatgtttac 
gagacggagt 
caagctccgc 
tacaggcgcc 
ggagacgggg 
ccgccttggc 
tcaacaattt 
tgttttttat 
gatttgatga 
atgtgagttg 



ccaattccct 
aatgaaagac 
aagctatcca 
tatgattcaa 
gttggctggc 
tatattttac 
agagtgattt 
ttcattttta 
ggtttttgtc 
acttttcaac 
atttcatgta 
attttcactt 
aagaataaaa 
atttatgtga 
atgcattgtt 
tgtgattatt 
attaactgtt 
tagccttcag 
gcactctgag 
accacttgta 
atatgctgaa 
cttagtgaat. 
gttttgatat 
ctgagggtaa 
agtgaccttt 
gggcatcact 
acaaagggca 
tagaatttgg 
acagatgttg 
ctcttgaatt 
agtctttaga 
gctccaaaaa 
gagcaagatc 
aggaggtgag 
ccagaaaata 
tataacattg 
tcctttactt 
tgataactga 
ctcttacaac 
atttttatta 
atacagatta 
atactattgt 
tatttcaatc 
agggaatgtt 
ttctatatgt 
ttttttatct 
tcttttctgt 
gtgattatat 
tttctggggc 
cctcagcatt 
ataaaaatat 
agtcttttat 
ctcactctgt 
ctcccgggtt 
cgccaccgct 
tttcacagtg 
ctcccaaagt 
aaaaacattt 
ctcctccagc 
tcgtttgcct 
atatatttca 



aacagaatta 
acctgcatta 
aggacacctt 
ctccattctg 
tgactttgcc 
ctaagaagtg 
cacatttcaa 
tatcagaggc 
agtttaaccc 
aagaaacagg 
atactttggt 
tttcaattct 
cctcacttcc 
taccagccgc 
ccttggcgca 
gagatttttc 
tggaattcca 
agggtaaatt 
tcatcacaca 
ggtaaactgg 
gatatggaca 
tgttccaaca 
ttgtcatcaa 
atttctatat 
ctatctagtt 
gatgtcaaaa 
gggcactgcc 
aaactatgga 
aagaaggtgg 
aaacatatta 
aattttcagc 
agagagaacc 
attcttatca 
agttctgtct 
acacattgta 
tctttttgtt 
attttgggtt 
ttttagactt 
tgatttagct 
ttgttcaatt 
tttaaaatta 
ttgtgatttc 
cttttaaatt 
ttatatgcat 
gtcatttaga 
attagttata 
aattctcctt 
ttatccacat 
tttttcttcc 
ttttatagaa 
acagaaatgc 
cttctactat 
cgcccaggct 
catgccattc 
cccggctaat 
ttagccagga 
gctgggatta 
tatggtcttc 
ctctttcaag 
agttcttgtt 
tctaatttag 
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aaaattgtca 
acaagaccct 
tctccttttc 
tcacccaggc 
tcaagtgatt 
gcccagctaa 
cgaactcctg 
catggctctc 
ctccctcctt 
gtttactgat 
catctcaata 
ttctctgcta 
tctgacatat 
tggaactgcc 
cctataatgt 
ttaacagggc 
ccagctgctg 
acccagcctt 
cttgagtgga 
tctacccagc 
ctgctcccag 
ccaaacttgg 
ggtgggtgca 
aagcttttta 
cctcctcctc 
ggaagagatc 
ttcattggct 
tcatttgtag 
tgtttataac 
gaccagaaaa 
cactgaggct 
tttgccttcc 
aacacctgta 
tcgagatcag 
aggtct ggtg 
ttgaacctgg 
gcagcagagc 
tggaagaagt 
gaatctttat 
ttttaaaaca 
ggttttcaag 
gataattatt 
gcaaggaaag 
caaagaattt 
tctcaatatg 
tttaaacaaa 
gattagcaac 
gcctacattc 
aagcaaaaag 
aactgggtca 
tgttaggcca 
atcagttgtt 
ttatttgaaa 
tgttgccttt 
cagtaatatt 
taatttaagt 
tgtaaaatta 
gaagtattgt 
aaaatataag 
agaaaaatta 
ttacaatctt 



cccatgttct 

taacgtaaat 

gtttttattt 

tggagtgcag 

ctgctgcctc 

ttttgtattt 

acctcagatg- 

ctttccttct. 

ttttttttct 

tcttttcttt 

ttttattttt 

aaattcttca 

ctattatatt 

tctatttctc 

atatatatat 

atgagatcta 

actccctgag 

ttcacaactc 

ctcagtgaag 

tttccgcaac 

cctcaagtca 

gtgcattatg 

gagctgcctc 

aaatgtgttg 

ttcccatcac 

atgactttct 

cttataaact 

ttctcttatg 

atggtccatt 

aaattgtaat 

gtgttgtttt 

aggggtatca 

atctcagcat 

cctggccaac 

gcacacgcct 

gaggtggagg 

aaaactccat 

ttatgaggta 

gtcttatttc 

tttgtagatc 

gaatttatat 

tgttaatgta 

acccaaagaa 

ggtgatatga 

aattaggaaa 

tgatgagtct 

aatggtgtca 

aaaacaagaa 

ccttcctaga 

tgtgtccgtg 

attgtatttc 

atcactctac 

agttatttga 

aaaatgataa 

aactattcta 

aaggaaaagt 

tcaatgtttc 

taacatccgt 

cacatttcaa 

ccaaatagca 

ataataataa 



cttcaaaaat 
atatttgatg 
tattttattt 
tggtgcaatc 
agcctcccga 
taatagagat 
atacacccac 
tctcttctgt 
tatatttcaa 
ggtatgccag 
tttccctatt 
ttttttttct 
taatataaaa 
tttcttccct 
agcttcttca 
agcaagggag 
tccgggcttc 
agatttctac 
tccccgggag 
agctttttgg 
gctatataag 
tccatgtact 
tgatggagct 
aaaattggac 
tccacctagc 
agattttctt 
gtgttttgtt 
gctttctaat 
tgcagtttat 
ggcagatgtt 
taatcttact 
tccatataaa 
tttgggagat 
atggtgaaac 
gtagtcccag 
ttgcggtcag 
ctcaaaaaat 
tcctaagaca 
gtccacagct 
aagctatctg 
taagattgat 
gatatttcat 
agcatgtaag 
taaaatccaa 
gccttcagct 
tttctttttc 
aggactgaaa 
gagggtagga 
agcctcttgg 
tctaggtgtg 
atctcttggg 
ctaagtgttc 
tattcaatag 
atagcaattt 
ttctcataaa 
agagtttatt 
tttcttgaac 
ggaacttaga 
atttatgtga 
tattgtatta 
cagataattg 



gtcttctacc 
gcataatatt 
tattttattt 
tcggctcact 
gtagctggga 
ggggtttcac 
cttggcctgc 
atttttctcc 
tgtgggtgat 
tttactaata 
cttgacattt 
gaaaaattgt 
ctcctctctg 
tgcccatatg 
tgtcttgtaa 
agcctcggat 
cagtgaattc 
gctttgctgc 
ccttggaggt 
cttagagggt 
cctccatttt 
aaagtaaagg 
tcctcaggga 
ttgtttctca 
atcagggaag 
tcagttaggc 
attgttgttt 
tcctaaccag 
aggggtggta 
atgttgtctt 
gtgcatggca 
taaactgctg 
gaggcaggtg 
tcgtctctac 
ctactcagaa 
ccaacgttgc 
aaaagtgaat 
tgtacaaata 
ccccacccac 
gttctaaaac 
tttcaaatcc 
agagacttca 
tagatatcct 
tttctaaata 
acacgtaaaa 
cttaaatctc 
gctagtgatt 
tacagcgtca 
cagtcatata 
agagagggat 
aatgggggga 
ctctgaaata 
ttataatggt 
aatatctaga 
tcttgtatca 
tcccagtcac 
cactgcacct 
aaaaatacca 
attttaaata 
agaatgaaat 
tatacatgca 



tcattatctg 
ttctacacat 
ttgagaggag 
gaaacctcta 
ctacaggtgc 
catgttggtc 
caaagtactg 
ttcttcctct 
ttctattgat 
gacctgtcca 
ggctttttag 
ctaaaaatag 
ataatttcaa 
gcatagttat 
ttttttacga 
ttctctttgg 
tctttgctgt 
tcagctctga 
gtttcccact 
tttctcatgg 
tcagatcccc 
ccccctggga 
cctgagtcat 
catccctttg 
cacttcattc 
tttttttgca 
tcaagttatc 
aagtggaaac 
tatgaatcaa 
aagccttgct 
ccttctcatc 
gaaggccagg 
gatcacttga 
taaaaataca 
ggctgaggct 
tccactgcac 
aaaaaataaa 
ctagtgtaag 
accatggatt 
atttgtagat 
tgtgtgctct 
ggcaaacaca 
aaatcaaatt 
atgagttttt 
gaaaatctaa 
ctggaatcct 
gcttttgctt 
ttcattctgc 
cttaacacct 
ggaaggtggg 
ttgatattgg 
tattacttga 
aatccaatac 
gtaatctcat 
tttttttgtt 
aaatagaagt 
tgaattttga 
gtgctaacca 
atctgttttc 
gttctgtaat 
atattaaagt 



tcctcttttt 
ctcagatggc 
tctcactctg 
cctcctgggt 
ctgccaccat 
aggctggtct 
ggattacagg 
ccatctctcc 
ctatcttctt 
attaattccc 
ttgtgtttat 
tgttagatta 
catctatatt 
gtttatacat 
tcccttccat 
gctttataac 
ctggttgctc 
gcctttgagg 
gctgtcctac 
ctctcctgag 
tactgtgccc 
cagagaggtg 
cacacttcat 
ctggcaaatc 
ctctctcata 
tctccagctc 
catcttgttc 
tgctagctat 
gtgtggaaag 
ttgcttttgc 
acactagtgt 
cgtggtggct 
ggtcaggact 
aaaatgagcc 
ggagaatcac 
ttcagcctgg 
aataaactgc 
acccaactgg 
taagaaagtc 
caagctatct 
agtctaggag 
tgaatgagga 
gttatttgac 
ctgttaaagc 
ctcatagagt 
gttgacattg 
tagacttcat 
atcatcagga 
ttttggcaga 
aacatggagt 
gtagacaaaa 
ttctggaact 
aaaatatata 
ctacatttga 
aaattgtcat 
aaccggttgt 
ttaaagagtc 
catcacatct 
tttggcttgt 
caagatcaaa 
atatgagcat 



75120 
75180 
75240 
75300 
75360 
75420. 
75480 
75540 
75600 
75 660 
75720 
75780 
75840 
75900 
75960 
76020 
76080 
76140 
76200 
76260 
76320 
76380 
76440 
76500 
76560 
76620 
76680 
76740 
76800 
76860 
76920 
76980 
77040 
77100 
77160 
77220 
77280 
77340 
77400 
77460 
77520 
77580 
77640 
77700 
77760 
77820 
77880 
77940 
78000 
78060 
78120 
78180 
78240 
78300 
78360 
78 420 
78480 
78540 
78600 
78660 
78720 
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attctttaga 
gttcactgaa 
tttcttttac 
ttgaaaaatc 
gaattagaag 
tggtttgctt 
caggatgaca 
ggatgtgtgt 
tcttggcagc 
ggggtaagtc 
acaacctagt 
ggtttctttt 
tcatgtgtgt 
ctagggtgac 
gataagacag 
gcctccaaac 
aacatttagg 
gtgtgtgtgt 
atgtagatcc 
ttgtcacaat 
taatgtaaaa 
ttaagctgtt 
tagatatttt 
aatagatttt 
gaatgcttgt 
cttcttcttt 
aaaaatctga 
tataatattt 
acacatggtc 
tctagtggcc 
atgagcatta 
tatgaacaat 
gacagaaaga 
accttgagga 
ttgattttga 
ctcaaattta 
cttctacaca 
cataaaatta 
cccttccctc 
agagaaacta 
tcaggaaatc 
atgtttgtgt 
aaaaaaaaaa 
tcattgcaat 
tctatttagg 
ctattaggac 
aatatgcagt 
tataatactg 
gaaaaatgtt 
cagagctcaa 
atgtccttca 
tgctaggttt 
atatttgata 
ttgacctggg 
tgcaacacag 
gctcctctcc 
gctcagaggg 
caccattcac 
ttactgttgc 
cattgcataa 
taaaatatat 



caaaatatcc 

taaatgaaaa 

ttcagtatga 

tggcataatt 

agttggtctc 

ttattactgt 

tttctatcac- 

gtatttttgg 

ctccttggta 

tcagtcatcc 

tctgtcaagt 

ggcaaaagaa 

acgtaagaac 

ctggtgagaa 

aatacatttt 

caatataatt 

gtttttttcc 

gtgtgtgtgt 

tagtggtgat 

aaatttatat 

gagttaaata 

aggcatgaag 

tattatttgg 

aatcaatcac 

aaagcatttt 

taatcgtttc 

attcatataa 

taatgtcaat 

gttcttaagc 

ccgagctgca 

acccaggcat 

tcacagtgct 

gaaacaaaat 

gactgctact 

catagccttg 

aacacatttc 

ctaaaccttt 

gaaggggttt 

ttaccatccc 

atcaaaagcc 

atcatcatta 

attcagatcc 

acgtgcaaag 

aataaattat 

aaagcgatct 

ttagtaatga 

gtattaaatc 

aaccagttgt 

taaagctctc 

aggctgctct 

gtttcacctt 

taactaaata 

aagggaaaat 

tggcactcca 

agtaaggtcc 

attgctcacc 

taggagctga 

cccaacaggt 

tgtgaagttt 

ttcctgtact 

taatagaaca 



aaagacatca 

tattaagaat 

aaagcatgca 

taaaagcaac 

tgtttcttga 

tttcacttct 

attatggctt 

agacggtatt 

taaccacgca 

gaagccagtc 

ggaaagtgaa 

agagaggggg 

tggaagttct 

cagacaaccc 

ttctcactgt 

agccgtatgt 

cagtgatcaa 

gtgtgtgtga 

cttaaatgtt 

ggagagatat 

atatagttct 

gaaaataatg 

tattgtcatg 

tctctttatt 

ctaccaatat 

taaaataaca 

ttttacctaa 

gtgtgtacat 

gacagagaca 

ccccataaac 

tcgaaggtcc 

ataatttttc 

ataggtgcac 

tggggacatg 

gtgttgggaa 

cttttctgta 

gagccttttc 

tgatgtcaga 

atacagatga 

tccaacagtc 

tcatcaaata 

taaagagggc 

gtacacaagc 

ttttaaaaga 

tataattaaa 

ctatcaaact 

tgaaacaaat 

ctctgttaca 

ttctctccca 

caaatgttca 

ccagtcaagg 

aagcaattta 

ttgccagtcg 

gtctggactt 

cacgtcatcc 

ttcacgagaa 

cccagcacct 

gggaaacaat 

tgatttggcc 

atagtagtag 

tacaatggta 



atgttttcat 
ttggttatat 
aaaatgataa 
tttccatctg 
tgtgaatgat 
tttcttggta 
aacaaataaa 
tgttgcctac 
tttacggaag 
ttggtaatcc 
gagagctgtg 
gaacagttct 
catgcccact 
ttgagccacc 
taaactcttt 
aaatattctg 
aaatacagca 
tattattaca 
gagtatttga 
tttaattata 
gttgataatg 
ttgtaagtaa 
ttgtttcatg 
ccaaaggttt 
attaggacca 
tttccaaaaa 
tgtggaagat 
ctgtaattaa 
tagaaatgta 
tctttgatag 
agataaatca 
ctaagttaag 
tctcctctgc 
tactgattca 
attcttccct 
atcagtgagg 
ttcagaataa 
ccatcctttt 
gcaaaaccat 
acacattcct 
gcattttcct 
cctagaggcc 
atcactaggc 
cattccagaa 
gaatatatcc 
caagtctaat 
ccatactaag 
aagtgtgtat 
ttctggctcc 
aaactcccca 
aattcccaat 
attgaaggta 
aacatgctgc 
caccttacct 
attcactcct 
gttccctgta 
gcagatcctg 
ttgatacaca 
tcctaaagta 
gtactatcta 
ttggcataca 



tattaaagag 
taagaataaa 
ctacattttg 
gctggtgcag 
tgtgattagt 
tatgtgtgtg 
acagaaaata 
tcaacagtgt 
atgacgttac 
cagcagacta 
ctgggaggct 
ttttcttctg 
gagcatgggg 
ttaccctagg 
tttattattt 
aaatctctgt 
tctcaagcaa 
taataattgg 
ctttttttag 
gaaactcatt 
atattttgta 
tcagttttat 
cctcataata 
aagcagataa 
tttctggata 
cctcaagcaa 
tgggagagca 
taaaattagt 
acagcacatt 
ctcttaacaa 
ccatgctctt 
actagactaa 
tcattatagg 
ttttctgact 
caaagcctca 
gaagagcagc 
atgatcctac 
tctgccttta 
cctattttta 
ctgatgagca 
aacacccaca 
cagaaaaggt 
tctttgcaag 
actccagggg 
attcattttc 
gaccaaagat 
gttttaaaat 
gtattgaggg 
tcctttatgc 
cccatggctg 
ttctgagtat 
agccttccct 
ttgatgcggt 
ttaattgctc 
tcctcttttt 
accgtgaagg 
aagggcccct 
ggctcaaaag 
gaggagccaa 
gcaagacttt 
gctcttttta 



ttactttcca 
ctctaagatg 
ttatgatggt 
taagcaagtg 
ctgatttctt 
gatatgcata 
tgtttacttt 
tctttttact 
cagcagccct 
gcaagggaac 
tctcagaaaa 
ccggacaagg 
agagtcctgt 
aactcctggt 
gtagctgggt 
caattcaaat 
tattgtgtgt 
aagacttttg 
catttaataa 
agtttgtcca 
atagatattt 
atagggttta 
gctgattaca 
agaatacaga 
aggtccttga 
aaccacttta 
gtgactttaa 
acaattcaca 
aaaatcctct 
atgcaaatta 
caggaattca 
tgttgagaga 
gttctcttcg 
ccttttgtcc 
atcgctcatg 
tggtcaccat 
ttcttagaat 
gacaagacta 
aaagagatca 
acccttactt 
tgtgcaaaat 
tggaaaaaga 
agagtctagt 
attttcattt 
ttttgtttaa 
tgagaacata 
ttgcaacacg 
gaattgtctg 
tcctggagaa 
gaatggctct 
tactgatttt 
taatttataa 
gggggggatt 
tgccagggga 
gggtttctca 
ttatcacaag 
aatctaaata 
tttgttttga 
tttgatattt 
actgcaaaca 
ttttttatct 



78780 

78840 

78900 

78960 

79020 

79080 

79140 

79200 

79260 

79320 

79380 

79440 

79500 

79560 

79620 

79680 

79740 

79800 

79860 

79920 

79980 

80O4O 

80100 

80160 

80220 

80280 

80340 

80400 

80460 

80520 

80580 

8064O • 

80700 

80760 

80820 

8088O 

8094O 

81000 

81060 

81120 

81180 

81240 

81300 

81360 

81420 

81480 

81540 

81600 

81660 

81720 

81780 

81840 

81900 

81960 

82020 

82080 

82140 

82200 

82260 

82320 

82380 
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tttctaatag 
ccaatgaata 
gtattattga 
gctatcacat 
ttaaagaagt 
ggagtgcata 
aatgttaggt 
tcctattgta 
gatctcgtat 
cttttaagat 
ctaaaagctg 
tacatggaaa 
tgtcccagca 
gccaggcctg 
aagcacaggc 
ctgtttcaga 
attgtaggcc 
tttaggatgt 
ttgaaataag 
ttctttgtat 
atgacaatac 
attattacat 
cttaatgtaa 
atcaatattt 
gaagtatatt 
gttcatgatt 
agtattcaaa 
atgaatatat 
atattgtcat 
tctttgaaaa 
caatttgtct 
aatcctatta 
tcattgtaaa 
cactaaatgt 
aacagtcaat 
ccgatgctag 
gaaactggaa 
agtaacacag 
ttatacccac 
aggtaaggtt 
aaccctcctt 
attttattaa 
aaagagttat 
tccaatacag 
gacctttcgt 
attcttctat 
agaagaacaa 
gaaggagaag 
gaaacagaaa 
gcactttact 
acatttgtgt 
tcatttccca 
tgagcatgaa 
tataattatg 
atatttctag 
gtttgttcac 
aacaatattt 
attagggttt 
aagctgtcct 
agtactccag 
agaatcaatg 



atatgtttat 

acatgcttaa 

atatctcata 

ttaaacctaa 

tgagcctcct 

tgtgcacaat 

ttctagaatt" 

atatgcagaa 

ttccacagag 

tttccgatgt 

cacttaatag 

tattacttcc 

gcatccagat 

agctcaagtc 

gccagctgcc 

ttgcaaagga 

tccataaaca 

aaaggttaaa 

acatgcggaa 

gggtaagtag 

gccataatca 

tgcagagata 

tacacttcta 

ttgattcata 

gaataattca 

aggcaatttg 

agaaactggt 

aggaagttct 

tttgattgta 

cagtttcaca 

aagtatttga 

atgagcataa 

ttcaggtagc 

caatcaaatt 

tctgtttcat 

aaccatcctt 

ccctaggttt 

aaattctgtg 

accaacaaaa 

ttacgattca 

taaagggatt 

tcaaatccat 

ttcacaagca 

gtttaatcct 

gaaacatgtt 

acaatgacac 

aaaggtcact 

aggccattta 

aacatgcctt 

gttccgttat 

tataattatt 

aagtcttgca 

atctgttttt 

aattgaagac 

aaatgatttt 

ttaaatgcaa 

tccaagatta 

cttaatcttg 

gtgcattgtg 

gttgtgacat 

gtctgtatta 



tctatttcct 
atattttagg 
tttgaaagga 
ttacatttca 
ttacgataaa 
tacataaata 
tatccaaaca 
gtttcaattt 
ctactctgtc 
ttatctgttc 
cttacttacc 
ctctgtgcca 
tttatgaata 
tcaaggctga 
ccacctccat 
gggtctctga 
aaaataaaat 
tatatatgta 
gcaaaattca 
tatggctaag 
ttgtccttat 
tgagacttaa 
tcaggaatag 
tgaatcatta 
gaattaatta 
gaaaagacta 
aagttcatga 
tgtctttgag 
tttcaatttc 
aattaagaac 
ggtaattttc 
atggctcccc 
atccatagtt 
catcagcatg 
tgttatgtaa 
gaaacagatt 
ttcttaggca 
gtcagctagt 
ttgtatctaa 
gtgccttagt 
taccatcacc 
aaggcagtca 
gatgaagact 
gaattttccc 
gctcagaagt 
atacctggcc 
cttaccattc 
tcattatagc 
taacatagca 
ttcaattaac 
tcctttgaaa 
gctgataagt 
tttttccatt 
taacctttat 
taagtgattg 
accctctgag 
tttgaactta 
gcatcattgg 
ggatttgggg 
cctcaaatat 
atttaactgc 



aaacgtattc 
gaaatggtaa 
aagaaagtga 
agacaagaat 
ctgaactcaa 
atttggttta 
tcatgatgat 
taagatataa 
attaagtgta 
tctgtatcag 
gaattagtaa 
cagactgggc 
ccatttccga 
tgttgttaat 
ggcacagccg 
cttctgtttg 
gaaatcgaat 
tctatgaaga 
tttttctttg 
tagtatgttt 
ctattaacta 
gacttactca 
tacagaattt 
tttggatcag 
gtttccattt 
tataaaataa 
gtataattgt 
aatacatggt 
gttatgtgta 
tttaaacaag 
atcctcatct 
aagtgtaaat 
gatgacagaa 
atacttttac 
atttatgctg 
agatgattgg 
cttcttaaat 
ataatctttt 
aacattggta 
ccccttaagt 
tcttgtaaaa 
atacattact 
acatatgtaa 
attttgaaag 
gacatcaaac 
ttcaaaatag 
ttgaggattt 
ctttttgatg 
gctgatatta 
actcataatg 
gctgaggaca 
gatggttcta 
ccattgaagt 
ttaaattagt. 
atgagtaaaa 
ttaaaatttt 
gatacaatta 
cttttaagtg 
catcctgccc 
ctttagaaat 
aaagacaagt 



aagacttaaa 
aatggtaaac 
cacttgctgc 
atttatcatc 
atgtaagcat 
agaatcaaca 
tgactattta 
tcgttatgtt 
cctaaattct 
tctcgatcac 
tttgggaatt 
tttgattact 
tgctttgctt 
atcaggacag 
tcatggagcc 
cttttgtgct 
tacaacttct 
aatgcatgtt 
tatgggtaag 
tctttgtatg 
aacctcaatg 
aaaaatttta 
tatctgagat 
attttgatcc 
aatcagttca 
tattctattt 
ccaagtgagc 
ttaagtgaat 
aacggaagca 
cttttatctt 
ttt'tatttgt 
atttgaaggg 
tgacgggaag 
atgtctgcac 
atactcaatg 
attcttttta 
atttatggtt 
tcaagcagtg 
tctaacctag 
catttcggcc 
actggatttg 
tcttttggaa 
ctctacttcc 
gcttgccaag 
gtgatcctaa 
agtaaactaa 
cgaaaatcaa 
tcaaagaaga 
gtagctttag 
ttgttgttca 
ggagggtcaa 
ggagtctatc 
gtgtatttac 
gcaatcagac 
tgaagggtaa 
ctcaataatt 
ctgaatttag 
gaataattat 
tctacttacc 
tgccaaatca 
gctgtttcta 



attttcacta 
agctctatta 
tttctcatac 
tttttttctg 
ctctaattct 
aaactaaata 
ttgccgattt 
ctccttattt 
tctccatagc 
tgtcttcact 
accgtatagt 
tcgatttcaa 
aaagttaggg 
gcgggttcaa 
agccacaggt 
ctaggaagac 
ttttaaccta 
aaaaagtggt 
tagtatgttt 
ggtaagtagt 
agaaagataa 
aaggctattc 
atttgtatta 
aaataaatgt 
atccttctga 
gaactatgga 
tctgtaatct 
tctattatct 
atggatgtac 
ctatttaggt 
tatcttgtcc 
aaagattggg 
aaactttcct 
agctggaaaa 
taatcaacaa 
atataattct 
atccagcata 
tatgctattc 
gaaaaatctc 
acagccttaa 
agcaagatgc 
tgaaaaagga 
ctcggtaggc 
tcagttctca 
tttttgacag 
aactcactga 
tgatcctaca 
aaaagaaaaa 
aatttacaaa 
gccagtctta 
ataacttgta 
tccgaaagat 
taactgggta 
tatatccaga 
aatatttttt 
atgtagttca 
gcatatatac 
ttgttttagg 
agatgctaac 
tacttaggtg 
aatattacac 



82440 
82500 
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8304O 
83100 
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85140 
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85560 
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85680 
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8580O 
85860 
85920 
85980 
86040 
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tggacaattc 

tgtagtggat 

ccaaaactga 

tgaaaatttt 

tctctcacat 

tcttttgagg 

aaacagtgtg 

ttcctcttcc 

tgtgttgcca 

ttgatctctg 

ggatacattt 

ttgttgtttg 

gtgcaatctc 

cctcccaagt 

tagtagagat 

tctgtccttt 

acatatatca 

attgaaagat 

aagcatattg 

atagacatca 

tgaaggcttc 

tgggtgttaa 

ggaaatggaa 

tcaagtatgt 

tatattatat 

ataaattgct 

atatctaccc 

ttagtagata 

accccaacac 

tctgagagta 

agggtggctt 

tggccaaaag 

tttcttcggg 

gcccaagtat 

ttcaggatgc 

ccagtcggt'g 

ctgattccca 

ttgttcggtt 

ggagaataaa 

tcccagcact 

cctggctaac 

tggcgggtgc 

gggaagcaga 

agtgaaactc 

ataaaagaag 

attactcatc 

gtcaggtttg 

tttatagata 

atatatatta 

ctgtggtatc 

attttaagga 

aatgtagaaa 

aaataaagta 

gtgtctgagg 

aattctgtaa 

aagtcaggaa 

tcagtgctat 

aataaacaga 

tgtagcattt 

ccctaagctg 

aaataacctt 



taaaggttta 

atactggtac 

cacattccag 

cttagcttat 

ttttgcacat 

atgtttatat 

caaacatactf 

cctataaatt 

tgtgggaaca 

tcccaggagt 

tttgttttgc 

tttgtttgtt 

agctcactgc 

agctgggatt 

ggggtttcac 

gaaatgaatg 

ttttagtata 

tattcctgag 

tctttttaaa 

caaaagttgc 

cttttttaag 

ccttaccact 

ataagatata 

atagtttcat 

atatgtatgc 

attattagtg 

tacatgaaga 

acttattgac 

aaagcttggg 

gcactccagg 

gatactagtt 

tgttgcttac 

aaaacatgcg 

agagtgactt 

gtggactcca 

aaaaccaccc 

tacactgaat 

ggttgtttag 

ttgattaaaa 

ttgggaggcc 

atagtgaaac 

ctgtaatccc 

ggttgcagtg 

catcacacac 

cacgatcatg 

tatgtttcat 

tttaatagtt 

cttagaatga 

tctctagcct 

tgactaaaaa 

agcacttcca 

aattgtcaat 

aaataaaaaa 

tcactgaaga 

gtctgaaata 

acagtctctt 

ggcattgctg 

aaaagggtca 

tcttgagaaa 

ataaatactt 

tgtgaatcaa 



ccataaatta 

aggacatagc 

gcatcccttc 

tttgcatctt 

cttgcaagtc 

agcaaacatc 

tactccccat 

cct'ttgcccc 

aaatgttgat 

cttctcattg 

cagaaaggca 

tgtttttatg 

aacttctgcc 

acaggcgtgc 

catgttagac 

acaaattaca 

tccttataga 

gatgtcatcc 

ggaaagctga 

gagttactgc 

ccaacctcta 

gatttattac 

tattaaatat 

gccaacctcc 

gccaacatct 

gagtgggggg 

agatcacatg 

acagggatcc 

aaagtgatgg 

agcatttgga 

ttcaacacaa 

tctggaaatc 

taaatgattg 

cactcttctc 

tattccacaa 

tctccacctt 

tgaccctgga 

ttgattgaga 

tttcttaaaa 

gaggtgggtg 

ccagtctcta 

agctgctcgg 

agctgaggtc 

acacacacac 

cagagcgtta 

tattttatgt 

aaattcatct 

cattaaagat 

aaaatggaat 

aaattatcat 

tggaccataa 

attgtcagtt 

atgatcccat 

aactgggtcg 

tatggtattt 

ggcttgacgt 

atcacaatat 

gtcatttcag 

cttttaagtt 

aatgaatttt 

cttgtataaa 



gcagaggcat 
tatctaagct 
taatcttcct 
cccagcatat 
aggcactgac 
cttggaaatg 
tatagaacat 
ttgtaaatta 
actctggata 
cttcctacta 
aaatccccag 
cagcctcggc 
tcctggattc 
accaccacgc 
aggctggtct 
tgctatttat 
ttatctcaca 
actctatcac 
tttttaccta 
atttgcatcc 
gcagctgcat 
ttttctactt 
aggtgtgtgt 
atattctata 
taaaaagttt 
aagaatcaca 
atctttggtg 
cagagatcct 
aatggactag 
gaattcacaa 
gctctggccg 
cctaagcact 
aatataaatt 
tggggactga 
cattgtgtgg 
ctccactcta 
attcatggac 
gaagtgaggg 
ggggccgggt 

gatcacctga 
ctaaaaatac 
gaggctgagg 
acgccattgc 
acacacacac 
tgctggctac 
gagtttttag 
acaatcaatc 
tttggtagct 
gttgtcattt 
agtatagatt 
tctcagagtt 
attaatgatt 
ttaactggta 
gttaccatat 
tcaatgcttt 
ttgagaggca 
atctatgaca 
agaggtgatt 
tttctttgat 
cagtaatagt 
tcacaacaca 



taatctttta 
aattctatat 
ttgactcctc 
cttcactggg 
cacctttttg 
aagaatcatt 
tcaggttccc 
ttgttttaca 
ctgctgttac 
gaattcatgt 
tccttctcat 
gctctgtcac 
aagccattcc 
caggctaatt 
cgaactgctg 
gccccaaaat 
tgaaatttct 
ttattgtttc 
aacaacagtt 
tgcctatctc 
aatatcattg 
ttaaataatt 
atatatatat 
tataaatata 
acatagaaag 
tgagcaatct 
tttctcattg 
cagggaggaa 
agtggcactt 
ggatggtaac 
tcctaagaaa 
ggccattctt 
caaagaattt 
agtggaaaga 
cagcatcaga 
tctttctata 
attgtggttt 
gaaatgataa 
atggtggctc 
gatcagaagt 
aaaaaactag 
caagagaatt 
actccagcct 
acaaaattaa 
ttttacattt 
atctacttag 
ttactttatg 
aaatctggcc 
atgagttcat 
gctatttttt 
ggacaagagt 
tccagtgata 
cttctaggaa 
ggtgtaatct 
ttttttttga 
gtgcaaaaag 
ttctgataca 
ccattataaa 
ttttctctaa 
agacaacttc 
aatagtgaaa 



atgatagtat 
atacagtaga 
ttatctagag 
agaaaattcg 
ttctggattc 
tctgcctgtg 
taagttcagg 
tgatcctctt 
taaacttttc 
atccatatta 
tactatgttt 
tcaggttgga 
tctgcctcag 
tttgtatttt 
accttgaaaa 
gctaacttcc 
ttttgcaccc 
ctctttatat 
tcatttcttt 
ttttaggaaa 
atagtaatgt 
gggtatctga 
atatatatat 
tattacatat 
acatcttgga 
aggtagcaat 
agtatccggg 
agtgattgtg 
gacaaactat 
attaataaca 
aggctttttc 
gcatgaagtc 
taattgttga 
ccaagtgtcc 
agccactgcc 
aacaggatga 
ttgtttgttt 
tatcagatgg 
acacctgtaa 
tcaagaccag 
cagggcatgg 
gcttgaacct 
gggcaacaag 
ttaaaaggta 
ttacttttac 
gcatttgtag 
catatatgta 
agtacaaaaa 
tgttacctga 
agaatgaatt 
tctaaaacac 
taacacaact 
attcaggtag 
aatctctttc 
gtattacatt 
ccttctactt 
agatctgaga 
ggtgggtatg 
gttttaagag 
cattagtatt 
acaaaacaaa 



8610O 
86160 
86220 
86280 
86340 
86400 
86460 
86520 
86580 
86640 
86700 
86760 
86820 
86880 
86940 
87000 
87060 
87120 
87180 
87240 
8730O 
87360 
87420 
87480 
87540 
87600 
87660 
87720 
87780 
87840 
87900 
87960 
88020 
88080 
88140 
88200 
88260 
88320 
88380 
88440 
88500 
88560 
88620 
88680 
88740 
88800 
88860 
88920 
88980 
89040 
89100 
89160 
89220 
89280 
89340 
89400 
89460 
89520 
89580 
89640 
89700 
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gcaacaaaaa 
gtgaccatta 
atcaattgaa 
ttttgattct 
gatggctctg 
tacagtcctt 
atgtgcaatt 
ctctcagttc 
aggacagcag 
gctctctaag 
ccagtggtca 
ctaactcctg 
gaacttttgg 
agtcaccaaa 
tacctcgcct 
aagctgtata 
caaatttata 
agttacaaat 
atctacctgg 
gatgtgttac 
atgtataaga 
aatttgtggt 
ttcatggaaa 
aatttctgaa 
ttgtttcctt 
tagcaagaat 
attttatgtt 
ccaggtgact 
gtatacatat 
tatatttata 
gtaaatatat 
gagcatcctt 
tggtaaaaac 
tcttcttatt 
tttttttttt 
gaaaaattta 
tttgataaaa 
tctgtttcct 
tgggttttca 
actcagtcaa 
ctgactaaac 
acatatgcat 
acataggcat 
acacacatac 
acacacatat 
acacacatac 
acacacacat 
atacacacac 
aaatatacac 
aaatacacac 
acacacatac 
acacacatac 
tattatgtaa 
ggccaatata 
tatattttaa 
agataacttg 
ttgtttccat 
tgaaggtaat 
gaattttccc 
attgttaact 
acaataattt 



acgctaaact 

aagttgtaaa 

atgccaacgt 

agctttttac 

aattggaggg 

tttctcttca 

caatcgagtc" 

tctatgatac 

ggacaggata 

agctgccaac 

gatggtgggt 

accagatttc 

gggaatactt 

cacattgaag 

ctttgttttc 

ccacagccaa 

gaaacaaaag 

tttgccacat 

ctacatctct 

cttttctatc 

aaatcagtaa 

ttcaccgtag 

gagaatgttt 

gaatccagag 

ctgtttagca 

actatgtagg 

gctccattac 

ccattatgaa 

atatatgtgt 

cagtaaatat 

atatgtatat 

cttcctaaaa 

agttattatt 

ttttaactgg 

catgattcta 

aagtaggaca 

ataaagagta 

ttcaaagcaa 

ttttctattc 

actttcctga 

ttgtgtttat 

tcgtgtttat 

tcgtgtttat 

gcattcgtat 

gcattcgtgt 

acacatatgc 

acacacatac 

acacatacac 

acacacatac 

acacacccat 

atatgcaacc 

atttgtgttt 

ataaaattat 

catagatttc 

ttcatggaat 

agatagtaca 

agacgtttca 

tgtacccatt 

atatacagtt 

tgcttttgct 

acgcagaata 



aaactacaaa 
ttttctcatt 
acttctgagt 
gagcatgagt 
ctaccaaaat 
gtagcttttg 
atcttttccc 
cctttatgac 
tgctgctgtg 
tccagtgcgc 
ggatggggca 
acaaacagcg 
tgcacagccc 
gttctcagta 
attgctttca 
aatttcaatt 
tataataaat 
ttgctttttt 
gtgtctgtct 
cttttctatg 
taagttaata 
tttgctttgc 
atctgtgtgg 
gagttatcat 
gcaagtaaaa 
tgatattgtg 
tgatgatgct 
ggtgggtatg 
gtgtgtatat 
atacatatgt 
gatacatgat 
attttcatcc 
atgtattgaa 
cattctttga 
agttatgtat 
tgttttataa 
tattacataa 
tttttgttaa 
caactaagaa 
gaacctgcct 
acctccactc 
acctccactc 
acctccactc 
ttatacctcc 
ttatacctcc 
atttgtgttt 
gcattcgtgt 
acatacgcat 
acacatacat 
ttgtgtttat 
atgtttatac 
ttacctccac 
atatatttat 
tcctggctgc 
ttaaataaag 
taatatgttt 
ttctacaaag 
ctgcagtctg 
gctaatatag 
taatattttg 
atctaataaa 



gaagtcttga 
aaattgataa 
aataacatcc 
atattcttta 
ctaaagggat 
cttatctaaa 
agatacagga 
cattcagaca 
ataacaactg 
ccaataactt 
ctgcaaggca 
tgtgctagct 
tcaaccagtt 
gattttcttc 
gattctttat 
ttgaaaaaat 
actagtaaat 
ctccctcatt 
ttgtgtctgt 
tttactctag 
ttatccaata 
tttgttttgt 
tgacttggtt 
tagaatgttc 
catttcttct 
cacttaaagc 
ggtttaatga 
tgtgtgtata 
atgtgtgtat 
atatgtatca 
aagtgaagaa 
.agtagttttt 
aaaaagatga 
aaattacctt 
ttaaggtatt 
ttactgtgat 
actcattaaa 
aggtaggaag 
gtgctgtcca 
ccatcaaggg 
. atgaaatagg 
atgaaatagg 
atgaaatagg 
actcatgaaa 
actcatgaaa 
atacctccac 
ttatacatcc 
tcgtgtttat 
ttgtgttttt 
acctccacta 
ctccactcat 
tcatgaaata 
ataatataca 
aggtgaccat 
ttttaacatc 
aaaaagctaa 
gttcaaaata 
ttagcagctt 
aagaaaaata 
gcatcttttg 
ggatatttat 



gtgtttggat 
caatgtaagc 
cttccatgtg 
ttttgaaggc 
tctttgctgc 
atgatgaact 
atggttatta 
caggcattca 
aaccaatgac 
ccaccccttc 
ctgctttctg 
ttaccatcca 
Ctttcctaaa 
cttctgtctt 
t tatagaagg 
tccaatctac 
acctttcaat 
atctctatat 
atattacaca 
aaatcatgac 
catagattta 
ttctgatctc 
tattttggtt 
tgcattctga 
gttaagcagc 
atttgaacag 
cttggttatg 
tatatatgat 
atatatgtgt 
cacaacatta 
ttgtggcctt 
atcatctaat 
tatttctaat 
tctctttata 
ttaaatattt 
ttcttatata 
gtcttaaatt 
gagtaaaatc 
cagaaatgta 
acttcgttag 
aaatatacac 
aaatatccac 
aaatacaaac 
taggaaatat 
taggaaatac 
tgatgaaata 
acttatgaag 
atctccactc 
acctccactc 
atgatatagg 
gaaataggaa 
gaatatgtgc 
attttctaat 
gtcacttgca 
agagactttc 
tttttttctg 
taattttaac 
ttgattagat 
tcatctcctt 
gtaaatttct 
agcctatgtg 



ctgaeccaaa 
cattctctac 
gtctctgata 
tgtgttgata 
ttgtcatggt 
ttctcagtag 
atattctgcc 
tacctcactg 
ttggcttcat 
actacttggt 
ctgttctgct 
caaatgcaga 
atgtatgtga 
tcagtttggt 
gaaaggggaa 
agaaaagttt 
taaattcacc 
ctatccatct 
tacacacaca 
aatatagatc 
cttatactgt 
tatattctct 
tccgtaactg 
atttgtctca 
aagtaaaaca 
gaggcacata 
atggtgactg 
gtaagagtgt 
gtgacacacg 
tatttatacg 
gtggcaatgt 
aattggtcct 
tttgtcatat 
tatatatata 
taagcatctt 
aacatcacat 
tatacactta 
cactgtgcct 
tcattcagcc 
ctgtaaagtg 
acacatacac 
acacatacac 
acacacacat 
ccacacacat 
acacacacac 
ggaaatatag 
taggaaatac 
atgaaatagg 
atgaaatagg 
aaatacacac 
atacacacac 
atataatata 
atgggctgca 
ttgattataa 
acagagattc 
tgaatattgc 
taacaaacta 
tctcatagca 
attggattat 
ggaatgaata 
aaaataaaga 



89760 
89820 
89880 
89940 
9O00O 
90060 
90120 
90180 
90240 
90300 
90360 
90420 
90480 
90540 
90600 
90660 
90720 
90780 
90840 
90900 
90960 
91020 
91080 
91140 
91200 
91260 
91320 
91380 
91440 
91500 
91560 
91620 
91680 
91740 
9180O 
91860 
91920 
91980 
92040 
92100 
92160 
92220 
92280 
92340 
92400 
92460 
92520 
92580 
92640 
92700 
92760 
92820 
92880 
92940 
93000 
93060 
93120 
93180 
93240 
93300 
93360 
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tacgcatttt 

cgtagcacat 

cagcatttat 

ttgtccataa 

gagatgaagc 

gaaaaagaac 

cacacatatg 

caaatctgaa 

cgcttccagt 

tttttctctt 

ctttaatgag 

cttagtctga 

gtctacacag 

cagaatgtgt 

ttggatagcg 

ttcataggaa 

atgagtagat 

tctcccacac 

atgaaaacag 

tcctcaaatt 

tctctcgagg 

atcaatgtac 

gagatgaatg 

taaaaatagt 

attataattc 

tttaattgca 

atcgtctttt 

tggcatcgta 

gtcattctgg 

gtcactttgc 

aatgcaggct 

tatcctgcat 

ttcagataaa 

aaatatttta 

tttgtacgtg 

aattctaagc 

tatggtttct 

tacatgaaag 

ttattaatat 

agtgttggag 

ttggtgctgt 

ccctttcgcc 

gctggtgcca 

aaactactca 

cataatggaa 

caataaaaaa 

cagaagattc 

aagtgtgtaa 

aataaaaata 

cggttctacc 

tcccaggaac 

gacaaaagtc 

tagtttttcc 

tctgaactga 

tgggagggaa 

caagtctaga 

tcgcaacaaa 

agaagtgaga 

atgtatttta 

ctctcacatc 

tatatcatta 



agggactttt 
agggaaagca 
tagaccagtg 
agagagagtg 
gtgtcgggag 
caataggatg 
aatgagagac" 
atattcagga 
ccaaaggcag 
aaggctttca 
tctactgatt 
ggtttaacca 
aaaggtactg 
ctagagggct 
tctcctattg 
cagtcctgtt 
aatgaatggc 
ctgtgcctta 
tgacttcaac 
gagaagcaag 
tgcactaaat 
agaggaatag 
gttgcatatt 
aattctattt 
actaaaaaga 
gaatccagat 
tttttttcat 
agaaaccctt 
ctggtattac 
tgtttttgtt 
tgcatggaag 
aacgtcgtct 
actgaatttt 
acataaatgt 
gtacctcttt 
ctccag.gtgt 
taaccatttg 
gaaaatctct 
tgatatgatt 
gtggagtctg 
ccttccccac 
ttctgccatg 
tgcttgtaca 
gtcttaggta 
agtgagccat 
aagaaattat , 
ccctatctgg 
tggagcagca 
tgaagaggga 
tcgttaagct 
tcagactcaa. 
ctggacgatg 
ataggtgctt 
actgagcttg 
aattgctttt 
ttccttacaa 
acagatgagc 
atgtggtatt 
tcctattttc 
agtagaaata 
gaagagtgat 



gcagagatac 

cagatcctgg 

acttggtcaa 

gtggtacata 

tctgatgcat 

tgtatgtata 

atttaattta 

caggcaggca 

tctggaggca 

actgattaga 

tcgatgtgga 

catttctggg 

acatttgtta 

ccagtgcccc 

gctgctctta 

ttagttaaaa 

aagcaatacc 

aggtagcact 

ttctaattta 

gacgagaaat 

tcctttgtct 

ttcctttaaa 

ccccggagcc 

attttacaag 

cctgattttg 

ttcctccaaa 

caaatgcttt 

tcctattttc 

aggttcatgt 

tccactgtat 

aaattctcat 

ttaaacaata 

ctaccaccct 

atctttattg 

gtttttttgt 

cttgcttata 

cagttttgct 

gtataacatt 

tggctgttcc 

ctgagaggtg 

tctctcttgc 

agcaaaagct 

gtctgcagaa 

ctgcttggta 

gccatttgtt 

atgcagtttc 

aagagagaaa 

ttcttctgaa 

gcttttctat 

cctttcaaat 

aactcttgca 

gccttgaaat 

accagccaca 

atagagttct 

tctcccattg 

cttctgttag 

tgtccctcaa 

ttaaattctc 

ttaactctgt 

actgagctag 

aatgactttt 



aaacacttgt 

agtgaaaaac 

gttgcttagt 

atagatattt 

agtacttcta 

catatgtaca 

aggaactggrg 

ggtctgaaga 

gaattccttc 

taagtcacgc 

tctcatctga 

tgctacagat 

aaaccataga 

tgcagatttt 

ctttctgtct 

gtgccgctgg 

tgatcccata 

gtttatttaa 

gcttttctcc 

gcttcatcag 

tcaactgatt 

tccaggagtg 

aattctagtt 

tgctatgtaa 

ctgcacaaac 

ctttatctca 

tattatcatg 

tttcatgttt 

ttttgggtac 

tctgaaaagc 

gccgggtctg 

gaaagcaatg 

gaatattcac 

tctttgtcct 

ttgttggctt 

aaaacgattc 

aagaggaaag 

ctatcaaaac 

ctccaaatct 

ttcgggtcat 

tcctgctctg 

ccctgaagct 

ctgtgagcca 

gcaatgcaac 

cctttgaata 

tgactcgggt 

gtcttcctag 

acacattttt 

ttaaggtgtg 

gctgagatgc 

ccagaaacaa 

caattccttc 

ttcttggagg 

gtgctcatat 

cagtaaaaaa 

tttcctgcac 

aattgttgac 

aagactgact 

atgtatttga 

agtttgcttg 

attttggtga 



tctgtaaaat 

ctacagcttg 

ttctctgaac 

tgacaggctg 

ttaggcaagg 

cacacacaca 

tcatgtgatc 

caagggaggt 

ttcctcaggg 

acattataaa 

aaaatacctt 

aaaattagct 

attctattta 

ctatttgtgc 

ttccattgtt 

ctcccagggt 

cagaagaggg 

cagacttgca 

caattcatgc 

ttaccttcat 

ctctcctctc 

catgcaacta 

gagctcaaat 

acagagaggt 

ataactaaat 

ttccagtcag 

tttgaagctt 

ttcttgacaa 

attcatttat 

acccacagaa 

ctctgtctcc 

gctaaggata 

tccatattta 

ggtcttccaa 

ctactatttt 

tggattttct 

gagaatctat 

tgtaataaag 

catgttgaga 

tgggatggat 

gctgtatgac 

tcccgagaag 

attaaacctc 

aacagcctaa 

tatttggaca 

tccttgaact 

gaaaatgcag 

gcttggaagc 

agaagagggt 

catcgacccc 

ctagatggag 

atgtctttag 

cgagctggct 

tcccagcaca 

aaaaaaaaaa 

tccatggata 

ttgtgcctta 

tcgtgtatcc 

ataagggcga 

ttgacaatgt 

aataataaaa 



atttgttaaa 

aattccactc 

atcaatttac 

ttaaaagcta 

tttttcagag 

cacacacaca 

atggggctgg 

tttaacactg 

aacctcagtc 

ggacaacctg 

cgcagaaaca 

atcacagaac 

catgaccaac 

tggcagtttc 

tctttgaggt 

cccaaagcca 

aaaacccttc 

aaaatctacc 

tcaaaccaat 

tacagtcgta 

ccaacaggag 

agcaacttta 

gatatattct 

aagtaccagc 

ggctacaccc 

gaaaacagct 

ttgattaagt 

gcctgagtcc 

catggagtga 

acatgacaag 

ccagtcagtc 

aaagaatggg 

atctgaagcc 

tagggragtt 

cttaatttta 

acttttaaaa 

aaagtatgtt 

tgaaaattaa 

tgtgatcccc 

ccctcatggc 

gtgcctgctc 

ctgagcagat 

tttcctttat 

tacaagtata 

attaaaccaa 

taagtggtag 

gagctgtttc 

ctaacatata 

gtcgggtgcc 

tgattctgtc 

aaaactacct 

agggtctctg 

aaagtaaaat 

gtctccatgt 

agtcaatgat 

atcaaattcc 

ttaggttggt 

atagagaata 

tttaaaagtt 

ataggcacaa 

aaggaatcat 



93420 
93480 
93540 
93600 
93660 
93720 
93780 
93840 
93900 
93960 
94020 
94080 
94140 
94200 
94260 
94320 
94380 
94440 
94500 
94560 
94620 
94680 
94740 
94800 
948 60 
94920 
94980 
95040 
95100 
95160 
95220 
95280 
95340 
95400 
95460 
95520 
95580 
95640 
95700 
95760 
95820 
95880 
95940 
96000 
96060 
96120 
96180 
96240 
96300 
96360 
96420 
96480 
96540 
96600 
96660 
96720 
96780 
96840 
96900 
96960 
97020 
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aaaaatagag 

cactggaaat 

atattgagtg 

taaaagaaat 

tttacttgca 

taaccctgat 

ataaacactt 

gatgatttca 

gttatgcaga 

aagaaaaaaa 

ttccaatgta 

tatttcttac 

tatatacctt 

agctctttta 

agaactcacg 

aaacagaatt 

cctacatttc 

tgagaatagt 

taatacattt 

aataaaattc 

ctttcttcat 

tttttgctta 

agttgccaat 

cttaactagc 

cactgaggag 

taacataaat 

aaaaaaatgc 

aaggcattac 

gagttatttc 

tctttctttt 

taacaaaaac 

cccatgtttt 

aaaaaaaaaa 

gtaatgccta 

gtcagatcct 

ctatgatgta 

atataacgta 

attatcatca 

aagaattcta 

atctggaaat 

tcttgaacag 

atatctactg 

aataaacatg 

gcatttattt 

tattcttact 

aaaatggaaa 

attttataac 

tcaaatccac 

ggctttctga 

tcttccccct 

tgaattatga 

gccaaaactg 

tgaattttta 

tcaaaagttg 

ctttcaggta 

cttgggacgt 

atcactaacc 

cataggcttc 

tcaccaagca 

tttacagata 

ggtaagtata 



ctgtctaata 

aaggaatgag 

ggaacacaga 

acagctggat 

gccaaagttt 

tactgcagtc 

tagaactgtcf 

tataaaattg 

cacagaataa 

acaatttgga 

gttcatgttc 

ggacttaagt 

ctctcttgat 

ggtcacctga 

cggcttccct 

attgttttga 

tttctaattt 

catcagatta 

ttttaactgg 

aaaaaagcta 

tatatcattc 

atataaaatt 

ttaaacatag 

acacatggcc 

ctctaagagt 

ctgagataaa 

aaatgtcctc 

ttggggcttg 

ctttctttct 

tctttctttc 

ctttaacata 

tccttgtggt 

aaatgtatat 

actatttctc 

ggcttatagc 

gaaatgataa 

tcatgcataa 

ttttattata 

tttaagaaca 

aaagtcaaca 

atattaattg 

ggattatcta 

ctcccgtttc 

ggttcagcta 

ttctctccta 

attataataa 

acagaccaag 

cattttagtg 

gctggcttcc 

acacattgct 

cagagaagga 

aacaaagaaa 

tatagagtga 

gagaatataa 

tgtttgaatc 

tcagctaccc 

tgagtgtgac 

a get tat tea 

taggcagtga 

ttgaaatttg 

taaactcaaa 



atgaaccatg 
ccagtggcca 
caaaataacc 
tgcctatcaa 
tgaaatctta 
agcaaattaa 
atctgtatta 
aaataatttt 
aaacaagtga 
aaaaaatctc 
agtatattag 
atgtttaagt 
tactctccag 
aaacttctta 
tgaaacgaat 
atgaatcatc 
tattacatta 
ttttaaaaga 
aattataatt 
aacattaact 
tctctatatt 
ttaggcaatt 
ggaagcaaaa 
agtgccactc 
tgattttgaa 
ttcttgactg 
agcataataa 
caaattcatg 
ttcctttctt 
tttccttctt 
taaacagatt 
agtctgctcc 
atttggcggt 
tcatttgaat 
tggatactat 
atcccacagc 
ggectcttat 
gecaataaat 
tacatataga 
caaaactttt 
tccacattcc 
catggtgcca 
tatggtgttt 
tgtaagggat 
attacagtaa 
aaaataagaa 
aaaatattcg 
ctctatattc 
ttggagtgca 
tgtaaatcag 
gtagatgaat 
ctggtcttgc 
cttgtcagat 
tagaagatta 
aaaagcaaac 
tcaaaccatg 
agagacaagt 
aatcagtttt 
tttatctgtg 
tagacacagt 
caagggacta 



tgtttaaaaa 
tggatcacag 
cccaaagtct 
aaagatccag 
gtagagcgtt 
actcaagcct 
agcagctgta 
gaatattcct 
accaaaaaaa 
gatateggea 
cataatattg 
aagagcatat 
ttttggggaa 
ataaaetaag 
acatcactgg 
agagaattct 
aatttaaaga 
ttaggctaaa 
agtcataagg 
gctgaagttg 
tttgaatgea 
agatgeaaaa 
cagcccagga 
agtacacaga 
acgtagtgga 
tccgtactca 
tctaggataa 
actgacattt 
ccctccttcc 
tttttatttc 
gataaattac 
aaccatacca 
gtgatatttt 
ctgagttgca 
agagcatgtg 
agaatgttca 
aaaaaacaca 
tgecagggat 
taaaatatgc 
ctatcctgct 
ctactttctc 
gacacccgtg 
atcatataat 
ggtaaaggga 
atgttcagag 
agttatccaa 
catagaattt 
tctttctgtt 
gacattagta 
ttattattta 
ttaggtagaa 
catggggtta 
caggaggata 
tacccaaatt 
cagggatttg 
aattttgett 
caatcaatct 
caaacatgat 
aatgggaata 
ctacaaaaat 
tgtattaggt 



agaaaagtta 
atataattgt 
ttgetaaate 
ggcagtggat 
ttgctttaaa 
gtattatgaa 
aaaaagcagt 
catggtgatt 
aatgttattt 
ctaatttaac 
attgtattag 
ccattttgat 
atcactttgg 
tggtacacta 
gaattctgag 
agtctaaact 
tttttttcaa 
atatgeaaaa 
ggaaagtcat 
ctttttgccc 
tatatttcca 
gtgtagtaaa 
gttaaagttc 
cacacaccca 
aattaccttt 
aaatattttt 
gtagttaatg 
gatgatatcc 
ttccttcctt 
taaaggaatg 
tgagcaaacc 
geaaggecta 
cttaaacaaa 
atttagacac 
ctgtgttcct 
agtgaaaaag 
gattccaaaa 
tttaaaaact 
attcaagatt 
gctggtattc 
tctcaagttt 
aaggataggg 
ggaggacatt 
atatgttcat 
gtatgggaat 
gtccacagta 
ggtatgcttt 
tttaacatat 
atacagacaa 
tttatatgtt 
agataggaag 
gtctccaact 
gaaattaaag 
aagcagaaat 
caacacaaga 
aagggtctca 
ctggaataag 
gggcagcata 
aacagaaata 
actatgtttt 
tatctaaaat 



actttctcat 
aaagattcct 
tatggtcata 
actaactttc 
tttaatttat 
atattatgaa 
gtcaagcett 
tgtatgtttt 
aaaaatactt 
tttgtaaatt 
aggggctcag 
taatgaggtc 
tagctgeatt 
gtaaacaaaa 
aaaccaatta 
cttctaaaaa 
actttttgaa 
ttcccttcag 
tttagtggga 
tttgeaaatt 
aagaaattac 
gcaacactaa 
tgacagctat 
ttttcacagt 
tttgggaaaa 
ttccaaaatg 
atgetacatt 
caggttagaa 
tctttctctt 
tgtaaattaa 
aaactgtcca 
gaaagagcaa 
gatttttatt 
aaatcctgag 
gactcctcat 
aaataggaaa 
tgacatggtg 
atcattctct 
ttgtaacagg 
aattttctat 
acttaaaaaa 
tagtggagaa 
cttggcttgt 
tacagaaaca 
ggtcaaacag 
aaactataca 
ccttgcccca 
gtaaeggegg 
cttctattcc 
tttaattttt 
gaattgctga 
cagaataaat 
ctagaagccc 
tcaggatctc 
tacacattgc 
ggttggtaat 
gcaatttcat 
tactacaggt 
acccattaag 
tagggaaaat 
atttaaaaca 



97080 
97140 
97200 
97260 
97320 
97380 
97440 
97500 
97560 
97620 
97680 
97740 
9780O 
97860 
97920 
97980 
98040 
9810O 
98160 
98220 
98280 
98340 
98400 
98460 
98520 
98580 
98640 
98700 
98760 
98820 
. 98880 
98940 
99000 
99060 ■ 
99120 
99180 
99240 
99300 
99360 
99420 
99480 
99540 
99600 
99660 
99720 
99780 
99840 
99900 
99960 
100020 
100080 
100140 
100200 
100260 
100320 
100380 
100440 
100500 
100560 
100620 
100680 
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gatcaaagga aacttctata aataaaatct aaaaacctga aaccataaag cctatgggtg 100740 
aaattaacag tacattagaa ggagatgaag atagaatgag cgagttagaa gaaactcaaa 100800 
gaaattatac agaataaaac agaggaatta agagacaagg aggacagaat gaatggttag 100860 
gctttatatt gaatcagagt ccttgaaaaa aaagaaaaaa ggatgggttg aaataatgtt 100920 
tgaagaaata gtgcctgaga atattccaga gctgaggaaa gacacctgca ctcagattca 100980 
atgattggaa caatttttca gtagtataaa tacaaagaaa tgttggtgat tgtaaaactg 101040 
caaaacacca aaatcaactg'aaaattttta atgtaacctt gaaaagaaag agaacataag 101100 
aaattagcat gataggagaa atatttaaat cagtaattga tttctcatca acagcaacgt 101160 
aagatgaaag agaatgaaat aacttcacaa gctgaaacat aataactgaa tctaatgctt 101220 
gaatttcttt cacaattata ttgtctaata atccaacaat cacattagaa attaaatagg 101280 
agcaagaata cttactgcag ccttatttgt attgctaaaa gcatagaaat agcatccagg 101340 
tcaataagta ggcatttcga taataattat gagatagcag ttcattgaaa tgttgggcat 101400 
ccattagaaa ggaaaaggca attcttaaaa atccacgtgg agttgtattc cagatgtaat 1014 60 
ttttgcaagt gcatagccat ttgcatacag aaaatgtatg tgtatgtgcc gacacatgta 101520 
tagattttgt ggtaatctct ggagagatac tcaagaaaat aattgtcgtt gtggagagaa 101580 
atgaggtgct gaagaaaagc agccaaggtt ggggggattc atactttttc taccccttgt 10164 0 
accttttaaa tcttgtttcg catgtactat tattcatgaa ttacatatat tttttaaggt 101700 
tggtttgatt ttagttttta attaaaaaca actaaaatac aaatatctcc acagttatca 101760 
acagcacaga ttaaggaata gtgttgtgaa gtgtcttcat tttaggggtg attgtgttaa 101820 
agttctgggg acttaaatga cctttttcat accatccaaa taccgattga gaatgtaaat 101880 
ccaaatctct ctacaaaatc catcctttta tttttactgt ttaatacctc cacatatttt 101940 
ttaaacagtt ctatacattt ggtactatat aagttgtatt gccatcaaat gaatattcat 102000 
gttcactcca acattcatac gtgtttttca acctatgggg tcctgggtag atgattaggt 102060 
catgagggca gagtcctcat gaatgaggtg agtgccctca tataggagac cccagaacac 102120 
tcccttgctg cgtccttcca ccatgtgaag gcccagcaag aggaaggcca tctatgaacc 102180 
aggaagtgat ccctctcctc tccagacacc aagtgaccct gcaccttgat cttgggcctc 102240 
gtaacttcca gaatcttgaa tgtatatctg tttataagcc aaccagtctc tggtattcca 102300 
ttatagcggc ccatacatac caaaagctgt ggcactaaca aaatccagca aagaagagtt 102360 
ggatctaatt gtttcattaa aatttgatta tgaatctttg ctacggccta aacagcagag 102420 
accagagttc tctccttcca ctgtgcgage ttacaaattg gccgctaaga gacaacaaat 1024 80 
ctaccagcac cttgatcttg gaattcccag cctccagagc tgtaagcaat aaatttctgt 102540 
tgtttatcag ccacccaatt tatgcttttt cattagaatg gcccaaaaca agtaagaaaa 102600 
tcccccaaaa caaatcaagt gactggtaca tgcctatagg ctcagtaata tttttttgaa 102660 
tgaataagtg aataaaggta gagtgaatgt gtattgcctt tgaatgtaaa cctttaatac 102720 
caattgcagt atgatactga attcacttaa gaattatttt tacatagttg tgtagaagca 102780 
atttataata tggaaatatt tctcaaattt acagaaattt gataaatatt atatttattt 102840 
tatgtaaatt cccaaactaa aaagacatgt cttttctctc actgtccatg taagttagtg 102900 
gaagacatcg atctcaaatt acaatattta actgtattaa attgtatata tagatgcaga 102960 
ctctaggata aaagattaaa tgtttgatat ttttaagcca gatcctatta ttaaaatcat 103020 
caatgatttt tactgtgaag gtgtagccac atgacattac cttcagaaaa cttcaaagca 103080 
gcaaaagcat gtcgcagagc caaatgcagt tcccagtgcc aagaaaatag ctaatataca 103140 
tgggcatggt tccaaaaaaa cccatatggc aatgtgagtt gaacatcttt tacagcagca 103200 
attccattct aacccaagtt ctttccctta aatagcacct cttttatttc ctgtagtcaa 103260 
actggctgta cagtcctcac tcttgccaaa gtccttgtct gggatgtgca tctccaggaa 103320 
ccgaagtgct aaacctttct ttttactagc ctgctctgcc aaaatccatc ttgacacacc 103380 
ttaaattata ttagatcaaa ggctgaaaat tatatgccat caaaagcaga aagataaaat 1034 40 
gtttatctac cgcattaggg atgtgcaatg tttttaaagc acttgatgtt tgagtgagac 103500 
attattcgac gtctctgcag tgcagcatgg cttacttcct tggagtagtt gatagatata 1035-60 
ttgtaacagg ctggtggcaa atatgaagtg gcatattcag tatgtgttta ggatgtttat 103 620 
gtgttctctg attcttgtgg tgctctgtct aattggaaat aaggtttacg tggtgagagc 103680 
agtgaagcaa cgacagggca atactataaa agcgactcct gccaccagca ctccatttct 103740 
tttaatggat ttctacatat ttggaaactt ttctaatttc tccttaaaaa ataaacccat 103800 
aaatgtcaag aagaatgttt ttcttcctaa tggagtttgc ttgccaagct ggctttgttg 103860 
caatgtctgt aggtgggcta cggtaaagac cagtggaatt gaagtaattt ttaaaggaat 103920 
attgtctcaa atgaaatata aacaaggaac tgtgtaccca ggcttcaaag tgataaacag 103980 
atgatgctat aatcaagtaa tttttaataa atatttgatg gaatttactg aagtccatat 104 040 
ttcagtaatc catcatccac tcccagatat acattgagat tacaaaatta acaatttgtg 104100 
ctgttgccag aattaaaatt tcctagtaaa gttgaactat tatataacct tagtattttt 104160 
gcatccttaa ttttttccag attattttgc aggctataaa catgttaacc tttaggacta 104220 
ttttatgtcc aggtactact taaaaatgta ttttagatac aacaatcata caatattttc 104280 
taccgttatg gtttcacagt agataactaa aattatatat atatatatat tttttgagac 104340 
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aaattctcat gctgtcaccc aggctggagt 
tccacctccc tggttcaagc gattctcctt 
aggattacag gcacctgcca ccatgcctgg 
tttcgccatg ttggccaggc tttaacataa 
ttctgcatac aaatttcaga aaaattcact 
ggtagagaaa agtaaaacat tgaaaaggta 
cattcatgaa tttacctaca* atttcttgga 
aagcacatat ttaatggaga aatttgtttt 
atttagaaaa ttatgacaaa aactcgttca 
actagtaaat gttaagtcct gcattatgca 
ctcaggcttg tttaagacaa atagcactga 
ttggaagaaa tgaggcccat atataattga 
cacgtgttta tccagcccct gcaatatagt 
accattatga agattcttac agtcatggga 
gcactgtgtg atgaatgctc tggcctgtgt 
cctgcaaaag gtagctagta aaagcttcag 
acattctaaa aaaagaaata gtttgaccaa 
ttttgtttgc ttgtttgttt gtttgacaga 
tcttgaactc ctgagcaatc tgcctgtctt 
gagccaccat gcatggtctt gaatttgaca 
tagtgcaatg gtttgcacat tggaccgctc 
gttggaggtg gggcctagtg ggaggtgttt 
cttggtccca ccctggcctt aatgagtgaa 
gttgttaaaa agagcctggc acctcacccc 
cttcccttcg ccttccacca tcattgtagg 
tggtatgctt cttgtacagc ctgcagaact 
attacccagg ctcatgtatt tctttattga 
ttgaagtgta agttttatgg aagagattgc 
aaagactttg aacgttttcc aaatgtatgc 
.aggaagaaca ggtgcagcca aaattaattg 
aggataattc tgctattaaa agagtagatt 
tcagtggtea ttgtgaaaga ggacttggtt 
catgggcagg attttgtggt caacttgaca 
ggaacaatct gggcagtgat gtcagtcgaa 
caaatggaat ccaaataaag taaaattctc 
tttaaaggaa actaagattt ttaaatgcca 
agtcttccct aaaagtgact taagcaagaa 
acagggtatt tggcttaatt taaatgatct 
aattttcagt tgtttattta tttttggacc 
tcagggaaaa ggaaagtcca gacacagaaa 
tcaggggggc acaatgatgg aagtgcagtt 
agagccaaga gggtgagtct ctcaaagcca 
gagagaaatg ctggcaagag aactgggcca 
cccatgagga aagtcaggaa ggagttactg 
cctgctgcta cgtgggagct ggtgtcttcc 
agcaagatac tgcatcagtg cattcccatt 
cttgccaagg gtgtgatcca aaactgagtg 
tccctacata gtttgcctcc tgaaaacctc 
gagccattca tccaggtaaa ttggagctgg 
rgggggcttc atcatgggtc ctggcctgag 
cttctttttg atgcctagcc atatttctga 
tttcctacta ggctaggtct tacttcgcag 
tctcctttga attcttccca agttctctcc 
agagatgcta tgtctatgat gccaacccaa 
cgctagactt tccaaactct aaaacatcga 
gttacttaaa ttaaagagta attgatttgc 
actgtataat cccagtgtag ttataattct 
ctgtcctctg attttatgaa ttctaagatg 
ccagtttact tgagtccaag taggctgatc 
gctgttgagg aggattttaa gccctccact 
caacctaagt atttcagagc tgctagtgaa 



gcagtggtat gatctcggct cactgcaacc 104400 
gattctcctg cctcagcctc cagactagtt 1044 60 
ctaactttcg tatttttaat aaagacaggg 104520 
ttgattatgt ttcaaaaaat aagaagcaat 104 580 
tggaaataag ttttattagc ttacactgtg 104 64 0 
actgtttgct ttaacattta cctgtaaaaa 104700 
agtagaaata tattctcaat gttgttttag 104760 
ttatagtgtt atttttaaac ataattttta 104820 
ttcccttttt tggaaaaacg atacttaacc 104880 
caggggttaa taaaatttag gatgacatgg 104940 
taggctgttc cataaaactt gttatattgc 105000 
atatatcaaa gttagtaaaa agacaaccac 105060 
gcattctgtg ctacactgtg aagatcgcta 105120 
tgacaagact tctgtaatgc aatgattaca 105180 
ggataaaatg ctttagaaac atgagagata 105240 
agaagatagg acttctgaat ttgggtctta 105300 
agacatggaa atttatttga atttgacata 105360 
gatagggttt ctccatgttg ctcaggttag 105420 
ggcctcccaa agtgctggga ttataggtgt 105480 
tattttgata tttgggtaag gggagaagat .10554 0 
caaacctcat gctaaaattt gatccccagt 105600 
gggtcatgag ggtggattct tcatgaatga 105660 
ttctcacaat gtcagttctt ggaagtgatg 105720 
tctcrtcttgc attctctctg cacacatcag 105780 
tttcctgaga tcctcaccag aggcagatgc 10584 0 
gtgagccaaa taaacctctt tcctttgtaa 105900 
aacataaatg gactaaaaca tttgatgtgg 105960 
caaggatgtg tccaaatggg gtggtgggta 106020 
caatacatct agactttact tttgagctag 106080 
gaaaagtgat agattgcatt ttgtttgaga 10614 0 
gcaatagaga gagtcaagaa gacaccagtt 106200 
tcatggagct gggggaaaga ggtcagagac 106260 
tcgggatgat gtacaaagaa aaggcaaaga 10 6320 
atttgggcag atcgttttga tagaatggat 10 6380 
aattatgaga attcaattaa ataattttgc 1064 40 
aagataaaag ctgaaagtca ttataacttt 106500 
aaaaaaacag tatatttttg aaactagtta 106560 
ctccagcaga tattcctttt gtggtccata 106620 
atacagtaaa gggttggaca gagataaaaa 106680 
ccgctgagtg ggcagaagca ctgtccgaga 106740 
aaggttgttt ccacgctgtc ctagatgtac 106800 
cagttgagaa ggaggtcact tgagtagcag 106860 
caaacacagt ccaaagggag atcacggaag 106920 
caactcaata gctgctatca taaaaacaac 106980 
tgtaaaatat actccaccta attatcctac 107040 
gccttgtaag tccgccttgc cttgtcttgc 107100 
ggctgatatg aattgggtgg tgctttcaat 107160 
ctttggtatt tgggccacgg taatgcaaag 107220 
aggtaggggc tgagcctctt ctggggcctc 107280 
tgcaaagata gcatctagta tcacagataa 107340 
ttttcttgtt tattaagagt gagaactaca 107400 
t'tcttaaaat gtgttaaacc tatttgtggt 1074 60 
acgcctggga agttgtggag ctggtcctct 107520 
ttacaaaaca caagcaaaaa caaacagcta 107580 
atctcctctt tgaccctatc atgaaagtga 107640 
tgttaaatac attatttgcc aaataatttg 107700 
gtgtggctaa ccctgtgttg ctctggaatt 1077 60 
ctgatttaat ttaatttttt aaaatatctt 107820 
cacagaatca tattcaaaga attccaatga 107880 
agcttcctgg catcagttgg ctttgagaac 107940 
atgaatcttt cttacagata ttacagaggg 108000 
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-ttacagaaag ttcagtttct gactcaccct gattttgaga aactgtttct tagggtcatt 108060 
ccctaaggtt ggaatcgctc tctctctctc tg.ttggttag tgccctgtaa acactgacaa 108120 
ttttctttct cttcagttct tcattcttgg tctagtgctt cagtttccaa gtctacatga 108180 
ctatgctaca gtgtaaggtt atttagacgt acaatttcat gatttctttc ttctgggatt 10824 0 
atttgcttga cttgtgatta gcttgcctga tcacagacaa tagtaacttt ttaaccattc 108300 
ttacatgatt tcccttttcc ttaaatttgt tgtttctgta ttagaagcag gaaaaaaggg 108360 
ccaacgttta gaatttgaaa" agcagtgttt gaatgcactt gttacctggt aaaccagtca 108420 
tttactgcta tcttacaaat gaccacagca catggtagac ttaaagaaca attatttgca 108480 
catttctcaa ttttgtggat cagaaactgg cagcactcag ctgagagact cttctgtttt 108540 
acggggattc acagcgttca ctcagagaca gatagctggt gggcaggccg gtgtcaggga 108 600 
ccaaggatgg catcactcgt gtgtctggcc cttgacaggg atgctggagg tctgggctca 108 660 
cctgggaatg gagatggcaa cacctacatg tgacctttcc agcatggcag tctcagagtg 108720 
gatatggcaa cagctgcaca tgacctctcc agcatggcag tctcagagtg gatatggcaa 108 780 
cagctgcaca tgacctctcc ggcatggcag tctcagarvtg gagatggcaa cacctacatg 108 840 
tgacctctcc agcatggcag tctcagagtg gatatggcaa cacctacaca cgacctctcc 108 900 
ggcatggcag tctcagagtg gatatggcaa cagctgcaca tgacctctcc ggcatggcag 108 960 
tctcagagtg gatatggcaa cagctacaca tgacctcacc agcatggcag tctcaggtta 109020 
ctggtacttc ttaggtctca gctgaggact ccagtgcagg tgttgcaaga cagcagacgt 109080 
agcactgctt atgtgtatcc tgaagtccca tgtgtatcct gaagtcccag aaagctgctt 10914 0 
caagtagatt . ccagtggaga agggagtcac caaggctgga cagtcattta tccatggcag. 109200 
gactagcaaa aaaatctgca ccatctctca tcgcacacac gtgggtaagg tttggactgg 109260 
tatgatcttt ttccataatc tcattttcgt gatttcttaa ctgagaatat aatagtaaat 109320 
accaatctca cagaattgtt atgaggaatg cagaaactta atcaaattac ttaattgtta 109380 
ttttcatttc ttattaattt gttcccattg aaattaatgt tatgtctggt ctcattttaa 109440 
attatttatc tttctgaatt tggatcccat cctcaagttt tcaatgcttg actttttatg 109500 
aattcttgta ttatacctca aatttaataa aatcctttta atcttgtctt caccctagca 109560 
ctgtctatac tgtcccctgg gtggaagttt cccatcattc taattacttt tatacacaaa 109620 
tttgacttct ctctccagcc cctgtgagaa acatgtcctt tctgttacac agggagtagg 109680 
tgaaggcctt tggacttcct atttttccta tatgaaactc tcttctttca gatatcagaa 10974 0 
ttttcctctc tctctttcat tcagaactct gctagacaat catcttacca ggaagatttt 109800 
tattcatcac cctgtgctaa ataacattgg tactatgctg ggctgagcta ctatttttcc 109860 
ttgaatcctg ctgtaggctt ttcttcagac acttaccatc accaggcgta atgtgaatgt 109920 
gtttgttcat tgcatgtctt ctccacgtga atggaaactt catgacaaga atgtgttcca 109980 
gggctgggcg ccgcggctca cgtctggaat cccagcacat tgggaggccg aggcgggcgg 11004 0 
atcacaaggt caggagctca agaccagcct gaccaatgta gtgaaacccc gtctctacta 110100 
aaaatacaaa aattagcctg gcttctgtaa tcccagctac tcggaggctg aggcaggaga 110160 
attgcttgaa ccctagaggt ggaggttgca gtgagccgag atcactcaag aaaaaaaaaa 110220 
agaaaagaat gttttccttc acctctgaat ccctagtgta taggggagag cctggaacag 110280 
agtaggggtt ccataagcca atcaaataaa tgactgaaaa agagttaagg gacaaggtca 11034 0 
aacaaaaaga gctaatttaa ctgctaaacg attattctat ttaggccaat tggtaaccca 1104 00 
tgactgcttg taaaattcaa tttcattgaa tgggacatat ttcaaacata aacataggtt 1104 60 
tttttaatga atataactga actgcagatt gcaatcaaga tgttccctcc ctttctcatt 110520 
tagatttctc atcttattag ccatcactga ccctcctcta gtggttaaat gcttagtttg 110580 
ggacattatc tcagccatac cctcagaatg gttgattcag tggttcgttg cccatgtgtg 110 64 0 
ctgtgatgtc aacaggctgt tgaatgctga gggcaatctc atatttggct aatgatggtt 110700. 
cttcagaaag caacaccctt cagattagtt cagcagggtt caactttgtg gtgaccacag 110760 
ttgggctgag caggctgcat gtcaggcaga gctgtcagtc tgctccagcc aagagtggag 110820 
tgggcacttc ttgggatttg gataaggaaa catttgtctg agcagt.ttta ccaccttaat 110880 
atgggtttga ttcttctatt tgtatacaga gtaagccaaa ttaaaaaatg aaaagaaaaa 11094 0 
agaaccagtt catttaatca agacatgcta gtggttagtc atcttttata acaaccacat' 111000 
cgataattca cctttttcat tttcaaatta 'gtttttggtt gttttgtaga cagagagcct 111060 
tagccataaa ttaaaagtaa tgataattag ccacatgttc ttcattatgt tgagggcttt 111120 
gaacatttct tcacatagct attagacatg tcagccacat gtaggaaaag gaactataga 111180 
atttcgtaat tttttttttt tttggaaaaa tgagcaacag aattttaagg cagaaacgtg 11124 0 
aaatgcgagt gatggcaagt cccagtgaag tggagaatag catgctttat gtgaatgtct 111300 
gatatttgtt ttctcagtgc ttttgatctc agaacaccta acaggactgg ggcagaagct 111360 
ttttgctgac ttgttgcatc agtgctaaag atgaaaggaa acagtcttca tattcgccgt 111420 
ggtgttgatt gccctataaa taggaaacta cccagttaac cttgcctgtt gtcatacgga 111480 
gagttttatt aaaaaataaa cagtgcgtgg cgggtagcag aatgtgggat ttctgcttag 111540 
atgcccgcaa ttctttcata tacttcatta aaatgtgcag gagaacttca gagccatgaa 1116O0 
taaaaatatt gttatggaaa ctgaggcctg ttatcatagc ttcagtgatt tgcaatgatt 111660 
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atttaatgaa tgagtttatg tttgaaaata 
cccctacccc tccgatttct tttattttga 
actcagagga gatgctcttt ttatcaaaaa 
aatcttacca gacttcccga acataaagaa 
taataagtaa tcttaggtgt gtgcatgctt 
ttgtatttga tcttaccagg atttgatgta 
aaagaacaga gaggacatcC ataaagccat 
gaaactgatg actatgccag gcatacagca 
accaagctca ccaaaacaac ttaccattcc 
cctcagaggg tgcagagatg aatttagaag 
tcaggcctgc ttgcaaatga gtggctctgg 
tttccgggag gaaaatgaga gcactgttca 
gtttcttgag agtctctgaa gttagttctt 
aagttagtaa cttttgcata cttcagtttt 
tctaccaccc tcacagacac acccaggatt 
tgatactcag tattaactat cacatggtcc 
ataaaaaagc tttctctttg accccgaaag 
tatttatatg gatttctaaa atattgtgag 
atctctgttg ccgtcagggt tggtttctca 
ctttctcacc gtgtcccaga atggaccttc 
ggacaccagt cctatcagat tagagtccct 
gccttaaagg ccctatctcc aattgccttt 
agggacacaa ttaactcaat aacagtgaca 
atagtttttt tctttacttt cacatagata 
tcattttaat tgaaaccaga attatattgg 
catttactct tatattaatg tattagcaga 
tcctagaaat tgactaatat ttttattttt 
tatttttgtt gttatctaat agtattcaat 
aaacatacat aagaatacaa atgtatcctg 
tgcaattgtt tctgttttgg caaattggaa 
totaggaaga atatttgctg ttgggtcagg 
tgatagacca ctgagacatt taaaagatgt 
atcaaaatcc cacaagggaa gcctgacctg 
aggtggtatt gaagccatgg atgaaattgc 
gaggacaact t.acccatcgg gcacattaac 
aatccatata aatattttaa tatattttaa 
aagagaaagc ttttttattc tcaaatcaaa 
agttaatact gagtgtcaac ttgattggat 
gtctgtgagg gtgtagccaa aggagattga 
ccacccttaa tctggctggg caccatctaa 
aggcagaaca acgtgaaaag cctagactgg 
ctgaaggctt cctgccctca aacatcggac 
ggctctcctt tctcctcaac ttgcagatgc 
tcatacttaa atacttaata aactcctata 
tatatatata tatatatcta ccatattcgt 
aataaaaatc tattaaattt ttgtctgaaa 
atgttcaatg ttgatatcaa tatgatgcca 
ccacaaaagt tataatgaga ccctgatttg 
gtttgagaaa aataaatacc tacacaagtg 
actttctcta tgtctttccc ataccccaaa 
tcagatctca gaaggcgaca tgaaggagat 
tgtacctcaa ttacatttat aaaettcttt 
ttattaactt taatgtttaa taaaccagtt 
tgttaaagtc tctttaaagc acacacctct 
ataaactctt gaaaaaggct gcacatatct 
attataccct ctcttaataa aagaattatt 
ataatacagt aaaaggtcaa aataaaactt 
gaaataaaaa aaaattacaa catggactat 
aaatgtgtga cgaatgatta ccaacatttt 
ggcactggtt ttctttgcac ttttagagtg 
gggaaattac catctcaatt cactttcttt 



tcccctgtga taatatgaat ttatcgctcc 111720 
cacatatgga agatgctgtt ttgttttgac 111780 
tcactgaaca gtaatgagat tttagtgaac 111840 
atctgatact gaggagtgca cagtgttcta 111900 
ttcagtggca gtattcattt ttcacgagtt 111960 
agaagacttg ggtagctctt gtgtagacac 112020 
agagacagga tcccaaccac agtgtgagga 112080 
ccatctcttc agcctaagag cagggggttt 112140 
tctttccgcg tgattggaac tcatcttcct 112200 
gaagtttgaa gagcccatgg tgtccagaac 1122 60 
gactgtcttt ccacatctcc aggagcctat 112320 
gaagtactgg aaggaacctg ccaacattga 112380 
ctaattttta gtaccctgac aatttcatgc 112440 
ctcatctgta aaattggggt aatgataata 112500 
aatactttgc atccttcaat ccaatcaagt 112560 
ctgaaaattt tattttattc tgatttggga 112620 
tttttctctt tttgttaaaa tatattaaaa 112680 
ccccaggaac tctggttaat gtgcctgatg 1127 4 0 
ttagggcact tctgggcttg tagagggcca 112800 
caagatctct ggtctccttt cctcttaaaa 1128 60 
ctcttatgac ttcatttagc cttaattatc 112920 
ggggctagga aatcaacata tgaatcttag 112980 
ttttaaaaac tacaaaataa gggaaatatc 113040 
ttacatcata ttaaaagata gatcatttct 113100 
aataaggggt gaccaatatg ggtttttcaa 113160 
agacaagagt tcgatatttc ttaataattt 113220 
actttctcag atatattcgt tttaataaac 113280 
ttagacaatt tggagaacat ggatgcctat 11334 0 
ctttcagcat cagagataat ttagggttat 1134 00 
ggattatggt tgatgaactg aggtaagtat 1134 60 
catgctttca gtttttatgt atgtagagtt 113520 
tcagaaggta ttgaactttt ggaaaggcag 113580 
tgtacagaca tttgtgaatt gtctgcatat 113640 
taactaaagc atactcttgt gagaaaagaa 113700 
cacagttcct ggggctcaca atattttaga 113760 
caaaaagaaa aagaaaaact ttcagggtca 113820 
ataaaattaa attttcagaa accatgtgat 113880 
tgaaggatgc aaagtattga tcctgactgt 113940 
cattgagtca gtggactcag aaaggcagac 114000 
tcagctgtca gcatggccag gatattaagc 114060 
cctagcctcc cagcctacat ctctctcatg 114120 
tccaagttca tcatctttgg gacttggact 114180 
cctattgtgg gaccttggga tcgtgtgagt 114240 
tacatatata tatatatata tatatacata 114300 
tctgtcactc tagagaactc taatacatac 114360 
cagaaaaaat aaaatattta acctttaaaa 114420 
gagcccaatg aaaacaaaag tacttagttc 114480 
acctagtttg gaaatagcaa cctttaaaag 114540 
tttgaaaata tcccaggtgt ctaacaactc 114600 
tatcaaccat gacatttttc cgctgatctt 114 660 
ccctctgctg tgagattgct attttcctat 114720 
ctttcatgcc agcctaagtg gtttttgaat 114780 
tctatagcaa caatcctatg acagatgttt 114840 
gtcaatatgt tattttgaac atttcaaagg 114900 
caactttcaa gctcacaatt tgcattaata 114960 
taatagaagg ctgatgagct aattttaaga 115020 
accactctcg agaagaaagt gtaaggtcaa 115080 
actactttcc tcaatgttaa ttactttaat 115140 
aaaatctatg tgcatttttc tccaatgagt 115200 
ctcagtcctg gcatatgtag gtcactgacc 115260 
cttctttgct atttcttatc tgatctcgaa 115320 
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gttaaggcat tgaaactatt gcaaaaccct cattattttt tcccaaatta aaatggtaat 115380 
atttaatagc tgcatatttt gatgcatgct tttggctctg gtttcctttt tttatgtgtt 115440 
tttttttatt gttgttgtta tgaaggggtt ttttttattg cttttttgtt aaagaacgta 115500 
gcaaaaaaaa tgaagtatac taaaagctat ttcacccaga gaaacataat ttgttgggtt 115560 
tttaaaaata tcattttcat ctgcttcttt gggaaacatc agaaataggc actgagggat 115620 
ggcaaggttt atgctgaact ctaagtaatg catgttgtca cttgcagcca gtcactgaac 11568 0 
tttctctgcc gtagcttcctf catttttaat gtggactaag actccacttt tatatcacca 11574 0 
cgtccaagtt tagacacgtg accctaagtt gctctatcag ggatctctag gccttcccta 115800 
tgtgagacac aaggtgggta tggactgcag tcaaaggcaa gtgaggtttt tttttgtttt 115860 
tttgtttgtt tgttcatttt ttgagatgga gtctcgctct ttcaccaagg ctggagtaca 11592 0 
gtggcgtgat ctcagctcac tgcaaacttc gcctctgggt tcaagtgatt cccctgcctc 115980 
ggcttcctga gtagctggga ttataggtgc atgacatcat gcactgctaa tttttgtatt 11604 0 
tttagtagag acggggtttc accatgttgg tcaggctggt cttgaactcc tgaccttgcg 116100 
atccacccgc cttggcctcc caaagtgctg ggattacagg tgtgagtcac cgcgcccggc 116160 
tgcaagtgag tttttaatga atgcatggac ttgaggacag aagaggagcc cgaggtttct 116220 
agtgggggct atgcacagtt ctgtgggtta aaggttgttc ttcctctttc cctgagctgg 11628 0 
ggagctaggg caagctatgt cttatgtctc ccaagagtca aaacctttgg ctccttctaa 11634 0 
ataccagcct agctcatact aacactggat- gaatacattt ccacaacaga aactcaaaat 116400 
ccatctgttt aaggatgcat gacaaatttc atttagaaca acagtatgag agtcatattc 116460 
aaatgtgaag caagagtgag cacaaggctc ttatttgttc cttaagaaaa aaaaatttat 116520 
ttcttagtga gtttaaatca tcaaccatgc ttcaggaaga aaaatgtagc ctggcctcct 11658 0 
ttccatctct ttgtgattgt ggaaaagtgg acctttaaga tttcactgaa agattacaga 11664 0 
cctgctcatt aattttacat tccaaacctt tgactggtga taaaatgaac aaaagaattt 116700 
aaaataaagg tggaaaaaaa tggatccctt ttcatccgtg agggccctca ggctatatac 116760 
aaacacaata cacaaaacct gcatcattta cctgggacaa tgttatgggc tactaagtcg 116820 
atccttttag aagagcagtg agttaatgga ttgttttcct gacaatattt gtagtacagt 116880 
aaaaaataaa atggggaaaa tatttttatc tggcttttaa atgttatgtt ttatggttgg 11694 0 
gttaatgttg tggctgatca gctgttacat ggtgcatgaa tattgactgg attctccatt 117000 
agggctgatg gggcatcacc gaccagctat gtgatctatg ttctcccact ggctcttgct 117060 
tggcatgaga gatacatttg caggatgagg agcaggaatg agagctatcg attgtagaaa 117120 
cagaatatag tagagctgat tgacctgttt caaaattaag aaatctgttg ccatatcttg 11718 0 
cattacactc ttttatttta tattcttact ctgaattcct tcatgaaatg cacaaactcc 117240 
accctgtata tacatatata 'tatatgtttg catttcctag cttttaaaaa tagtttaaag 117300 
tataccttct ccaaagggga aatactttga aatatttttg gaagagaaat tgtattagtt 117360 
tagatccata ataaataatt atattgtttt aaaagctggt gcaaagtaac attttcttat 117 420 
attgtagaac tctaagcctt ttggcttaga aaatgagagc aatattcttc tgcttttcat 117 480 
cacttacagg gatattcttg gtattattat ataaatttta ttttcattaa atataccata 117540 
aaattccaaa tgtgtgtttc gaattactgg cagaaaaatg ttggtaattt tttccagtgt 117 600 
gacttaactg tcattataat taagtaattg tctccaccaa aatgtttgtt taggcaatac 117660 
atttctgcat atttgaaaac ctaataaaaa gttttattaa agctgttcac tcaaaaatag 117720 
acaactgtgc cccgctgagc agtagtttaa aaccaaatca atgaatttgc tattatgtag 117780 
acaagaaagt gtcctatcat tgacattagt aatgtttcca ccaagttgaa cgtgatgagg 117 840 
gagctgttag ttgttaccct ggaaactatt caaaaggtct agagctgctg aaaaacagaa 117 900 
gactctgggc tcacccctgg gattccgctg cctttcagac catctaggat ccaaattttg 117 960 
ggaatgtatt ctgtatgtct gaacatgctt tactgccatt ttagactgat tgtatctaat 118020 
tttaagttta tgatgccacc tctctttttc cctatatttc taaactttct ggaagacgtt 118080 
ctgaaatcat tcaagtttat tctcattttt tatatctcag gactagtcac tcaatcttta 118140 
aataaccact tttgatcaag agattactgt acccttctga tacaacttaa gatcataaaa 118200 
acaagttgat ccaattcaat agctttttct tatattataa ttcatgccct ggccgaataa 118260 
ttccaacttt ttcttttttt gaaatatatg ttaggttttc ctggttctca atcaatctac 118320 
gtgagcactc attctttctg tattctggtt cctactacat cttttatcag tgtctctttt 118380 
tgcattcatt gatcagtatc tcttttttta aatgcagaaa aaatagagtg tgcttataaa 1184 40 
tgtttaacta ttagccagct atttaacttc ctaacactat ctgatatgct tgttattaaa 118500 
tccaactcat tttctatgcc tttgattcgc cttatacact actaaggaca gaagagcaat 118560 
ggagtcgatc taaaaccaga gaaccagaat cccttactcc cacagagaga ttgcctgtgg 118620 
gctttaggca tcggtttact tgtctctaga gctcaagttc acaagcacca gtagccccat 118 680 
gggtagaatg tgaacctctg ttctatacag ca'tgttgatg aatgcattac ttctgcctta 118740 
gtcatgtcgg agcacagtaa atgtcaacta gcagtgcagc aecggaacac gacatgcaat 118800 
ctgctcacct gagtcatgct gctttcagaa gagaaggcac tggtttaaat ctgcatggca 1188 60 
agcctgcccg agctagaatt cgatgagaac cctcaggtta atagctttgc tctttgagag 118920 
agctctaagc tcttttcatt ccagaccaat ttcacattta catactacct gtcactaaag 118980 
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ccccaaggtc cttcgccaac aagaagcaga 
ctttctagcc tttggacaca gtttttaggc 
gccatttgtg ttcctaaaat agcaaggggc 
tttttttttt gttgttgttg ttgtttgttt 
tgtggtccag actggagtgc aatggtgcaa 
gttcaagtga ttctcctgcc tcaccctcct 
atgcccggcc agtttttgtar tttttagtag 
gtctcgaact cctgacctca ggtgatccat 
aggtgtgagc caccacgccc ggccctagaa 
attccgttta aaatgtcact ataaatctat 
tattccactt aaaagaaaaa caaatcagag 
gttagggtac acagcccact ttgggaccat 
aatgtgattt cttttattat tattattatt 
agaatatgca ggtttgttac ataggtatac 
tcctgtcatc tacattaagt atttctccta 
cgacaggccc cagtgtgtga tgttcccctc 
cccacctgtg agtgagaaca tgtggtgttt 
aatgatggtt tccaacccgg atgaagcttg 
gtgccgtcca tgagtgagtt aggatgtttt 
tcttccttaa gcataaaaag atactttatt 
tcctcaagag tggcttatct aaggcctcaa 
attatatgta ttgattaagc ttcatctatg 
tcagaaagag ggtgacactt cctgatggac 
agccccgagt tagactaact tgtgaaagag 
gacttggaaa gtgctgattt tgaagcccca 
tgactggccc cattttgctc ctaagcccac 
catgtaggcc aagctggcta cgggtggaat 
ttgtaataat tccttcccaa aactaaccac 
aagaacatta ggttaaagat tataagagca 
cagtccactc cattggaccc tcattgctgc 
gatttcaact ccttccctct ttcctcttcc 
ggatggttct ttaggacact aatcttccat 
gtcagcttcc ttgctgcaac atcttgtctc 
tatgggcttt gaactcagct acatgtctcc 
ccttctgttc acacctgtct ccatctcccg 
cagaggagat ggtgcgttcc ctcaagccat 
gatcctgggg ctaacaaaac atcaaagcct 
atacttctag ttccgttagc ttccagccaa 
aggagcacat ttttggagca cttagtatag 
ggcagacatt tatcatagca gttgtgtaag 
gaaacatata ccaaaaaatt ttaagtaata 
aagcaggagc ttagaaacaa gcttgtctga 
cagaatagag aatgttgctt gaaaacatag 
cattcttctt tgaattctcc acctagattt 
acacttaaaa gtattgttct tttaaggaca 
tcaaatttct tccttagtcc tacaatttct 
ggagtttact taagtcaatc aaccttcact 
tttcctttca tttagcttag ctcaaagtct 
ggagtctcgc tctgtcaccc aggctggagt 
tccgcctccg gggttcacgc cgttctccta 
gactacaggc gaccaccacc acgcctggct 
ttcaccttgt tagccaggat ggcctcgatc 
tccctaagca ggggtgtctt tatttattta 
tgagaaagag tcttgctctg tcgcccaggc 
gcaacctcca cctcccaggt tcaagcgatt 
ttacaggtgc ccgccaccat gtctggctaa 
atcatgctgg acatgctggc ctcacgacct 
ctcccaaagt gctgggatta caggcgtgag 
ttaatgccgc atcttcagtg cctaccatag 
ctcttccact tatccctggg ttaggcccaa 
gatcccagaa aaatctgttt tctctgcacc 



tatttactca ctaaagttaa tgtcccagcg 119040 
aacaaggaaa ataatacata tgtataaatg 119100 
tataacccaa agcctagaat tctttttcgt 119160 
gttttgtttt gttttgaaac agtcttactt 119220 
cctaggctca ctgcaaactc agcctccctg 119280 
gggaacctgc aattacaggc gccagtcacc 119340 
ggatggggtt tcaccctgtc gttcaagttg 1194 00 
cgtcttggcc tcccaaagtg ctaggattac 1194 60 
ttctttttct taaagacaat tataatggta 119520 
ttcatttatt actgcatgat tttaatctat 119580 
aaatcagcaa gggggcaata acaaatgtgg 119640 
attcaatgaa gaataagatg agagaaagat 119700 
attatacttt aatttctggg atacgtgtgc 119760 
acgtgccgtg gtggtttgct gcacccatca 119820 
atgctatcct tgccctagtc ccccaccccc 119880 
cctgtgtcct tgtgttctca ttgttcaact 119940 
ggttttctgt tcctgtgtta gtttgctgag 120000 
atttattaag accaatagta attacagagt 1200 60 
ctgtgacatg tgagagaaag cccaattcaa 120120 
ggcatgtatg agtaacagtt caggagtaga 120180 
agatcataat cagagtctaa tttaactcat 120240 
taagctctat acttaagctc cagttcctac 120300 
ataactgaaa ccccagaatg gggtcacaaa 120360 
cccaccttag aaccaatcat gatgcccaaa 1204 20 
tcatcgaagc agtgaaaagg aggcccggca 1204 80 
ctttaggtaa acggcttttg cttatctctg 120540 
ttggtttata gtttcacttt aaagcaaggt 120600 
ggaggaaaca agaagggcgt acacataatt 120660 
ttgtgacctg accaaggatg aagaacttca 120720 
ccagatatat gtggtcattg gttgcctcgt 120780 
ccaaacttaa atgttcctaa aattctatta 120840 
cttctcagtt tgcttgtcct ccaaaataaa 120900 
ttgacttatt tgctgttgtg tggcaagcag 120960 
aagtctgtga aaatgatgtc ctcttagagt 121020 
gggctggaga tgtagtcagc ttgacctatg 121080 
atgtgaattt cctgttgcca taagaagatt 121140 
attatagata atatttaatt aaatctatgg 121200 
ttgtcaatgc cacttcataa aaattaagac 1212 60 
actagatctt atgttaaatt atttagacta 121320 
taattataat tgtgtcttct tagagctgag 121380 
ttgacaaacc cacatagcca gtgagaggag 1214 40 
ctccagagac tgcagctttg cttataaaga 121500 
gttcaaatat atgtttcacc acttagtagc 121560 
gggatatttt cttctatagc taatgcctaa 121620 
ggcaccttca gagcctgtca gaagttgcga 121680 
attagcctct ttttctatgt cagttactct 121740 
tcaaaacttc ttccctggaa tctttttcct 121800 
atgcagtctt ttttttcttt tttttgagac 1218 60 
gcagtggcgc gatctcggct cactgcaagc 121920 
ttctcctgcc tcaacctccc gagaagctgg 121980 
aattttttgt atttttagta gagacggggt 122040 
ttctgacctt ctgatccgcc cgcctcagcc 122100 
tttatttgtt cattttattt tattgttttt 122160 
tggagtgcag tggcgtgatc tcagctcact 122220 
ctcccacttc agcctcccga gtagctggga 122280 
tttttgtatt tttagtagag atgacctttc 122340 
cctgacctca ggtgatctgc ccacctcggc 1224 00 
ccgctgcgtc cggccaaagc agggatgtct 1224 60 
agcccggcat gcaatagaag tgtgcgctaa 122520 
ctcctccagc tatagtagta gagtcaagga 122580 
caggctttta agtcatcatg agcaagactc 122640 
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tgccttttct tagtttttcc aggtggtgaa gatgtggaac aggaaaaacc acataaaaat 122700 
gcaatcacat gaataaatat tgcctacaat ggctggattc aactgaagaa cttacactct 122760 
aaatcaggag cacagactca tgtttatgcc tttgtcccag gcaccaactt ctcttgaact 122 820 
tctttgctga ccctgcctga aagtggagtt aactaaatct tgtgtagatc tgtctcatat 122880 
cactatttag cccatttgca gataggtatg atattgttaa aatgatgaaa atgccccacg 122 940 
ttggactcag ttggatattc tgctccggca gatcaggttc taactggaat cttggttcac 123000 
ttcacttttg taggctaaga'tattctattc gtacttgact gagcattgta ttgatgatcc 123060 
ctgataatgc agctcttcaa atgtaataaa tttcttttgt aattgactgg ccatgcccac 123120 
tcaagagata tatgtttgtt gcctgattga gtgtctaagt aaagaattaa atgaataaat 123180 
gaaagagtaa cataagaaat aaacaacact gggacaatga actgagactt catttattaa 123240 
gaagcaatta ttgataattt tgtatatgat tttagactat aactaatata ctccaaatgc 123300 
aatagtagac aagatagaaa tggtcccttc cttcatactc gtggttgaaa tacattgtat 123360 
tgctgatgtt ccatagcttt tttacacatg ttgttttcat ttgtttcaac taacggtaag 123420 
gagtgtggat atttgaatcg gccagtgcaa aatgcagtga cttggaaaga gggaatcact 1234 80 
gttagtgaat cataaacaat atttgaaggt ccttcagtgg tttattaact tcaaataata 123540 
taactagatg agtctcggtt ttacaagtaa ccaggaagta atgcaagaat cctagcgagg 123600 
gtggtatcaa aaacataaag tttatgtata aaaagcattt gaactcaaga atttgacata 123660 
gaccttagtg aaaatgcagc tatttcttgt cagtttaaaa gaaataaata atttgttaaa 123720 
ttcaaagcaa gggatcccta atcaacttag ctcacaggta atttcaaata aacataatgc 123780 
ttattaacag ttccaatatt tgactttttg ctcaacctaa attcagcaat gcaatgtgct 123840 
gagatatatc ccagtaaaga gcagctgctt. ttgttttgtt ttattttgtt ttaatttatt 123 900 
tcacaaacaa gtttctgcat gagcctgaat ctttacaatg tgaaagctgt ttggattcag 123 960 
gtaggaaata cgtttgaact gatgcatacc aaatttccaa aagttcacct ttttatctta 124 020 
ctgcttagct gttgaataac acaacttttg cttagtcaag ccttctttca ttttcttgaa 124 080 
gatgaaaatg taattcttga gttacttctg aagaacaggt tggggaaatg caaataagag 124140 
cattactctt ctagagtatg aaaagattac caaatttagg ttttctggtc agtaaaatta 124200 
tattttccta acttgtttac taattacaga gtcatttaat tagattatca tttcttcttt 124 260 
ttccattgcc atctttttaa atgatgatat ttccagaact gctctattta actgttcagt 124 320 
gaaattacat tcatgaagct gacaattatt cagaagtaag tttcctatgt aactctccca 124 380. 
ttcttcatat tctttgtttc taatttatat ttgggtctga aaattgataa agttactagt 124440, 
ttagaaataa ttaactgtga cttcttattt tcagaaacat aaatgagggt gcacttctcc 124 500 
gcagaacctt tttcacattc ttttaacaac caaatcattc ccaattttct ggtgtgtaaa 124560 
ttccatgtga tcagtacaaa taaacaaact gacaagattt gggcagttgg aacccattgt 124 620 
tttacaacgt ctttgtttaa aatctttcat aattcatctt cagcaatggc cttcctagca 124 680: 
attagttacc tttagtgtct tcattttcgt ttaaatgttc aatgatcagg ataattagaa 124740 
ttaatttctc attgtcattt tatggtaatg aagcttctgc cacttatcac aagcatcctg 124 800 
caggcacagt gttaccacac aaatatgtga ctcctcactc ttcacacact ggtttatggt 124 8 60 
ccctggttaa agtggtaaca gtaggacctg gcctgattcc agtttaaatg tatgccatat 124 920 
tagatgaaag attgtgggta gtcatgctgg tattcttgcc ccttgtggat ggcattctat 124 980 
gtgcaactca gttctcctta agcaacacaa gagaattttc gtctccatta gtctgggtgt 125040 
tttacagact gggattagtt ctacagttag aacaaaagga aatatctttg tgaatgaaga 125100 
tggaaggcaa tgtcaattca aaactttata aaaacttcag atagaattca ggaaagcaaa 125160 
ggttgaaaaa cttagtacca gaaaatctat gaagaagaaa gtgagtcagt gaattctcaa 125220 
gctggaagag ccttcttgca ttatctggtc taggattctc cagccacaaa aaaaaaaaaa 125280 
aagtaattgt gtattatcat ccccattcat tccctgtcat atctctgcct aagcaattgc 12534 0 
agtagaattc atcctatcat ttgtatcctc ttccttactc ctgggaaaca tttagatagg 125400 
gtaaaggatt tggttgagca gaaacctggg gtgggaaata gggagcatat ctaggtttgt 1254 60 
caaagaaggt atttaaggct gccctgtgaa atcataccag tttcaaaatt agaaacttga 125520 
tcttttttac agctgacttt tcttcaccac attttaaact tgttctttat tctgcatttt 125580 
ctcaagaagg gatatgatat taccaagaga aatacttgct ctcaataacc ccaacctcat 12564 0 
.tttcttatga gtttgggtaa gttagaaaga aggtcaaact tagtagctta aagaaaaaaa 125700 
cacattttct tttgcttata attgttgagg aaggaaattg gaaagggcta ggcagggcag 125760 
aattggaagt tttcctgtgt gtctggcaga tgttggctag tgctgcaatt atgtggagtc 125820 
tatttaaact ggccatccta tggcttcttc atgtggctgg tggtgaatgg tggctgttgg 125880 
ctgggaactc agccggggat gacaagtgga tcacctccat gtgtctttgt agcacagcag 12594 0 
tttcaggctt ttagaatttc ttccagtggc tggcatatac ctgaaagaat gtcccaagag 12 6000 
agtctaaggc agaaatgtgt ggcttattta taactagttt tggaagttac aatgagacat 12 6060 
ttctgctgca gtttactgat cacaagagag ttactaagat tgatcgagtc tctacccatg 12 6120 
tgagggaggt gaatcaaaga acttgttgga caggttttag acctgtcata gagtgtaaaa 12 6180 
aagctaagag gcttcatatt caagggaaga aatactttaa agttctccag ggatgaggct 12 624 0 
agtagtgatg attcagggca agaccagtga agtaagagag gctgggacat gcattgaaga 12 6300 
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agctcactga 

cttgcaagat 

ttgatagttt 

aaataagggt 

ggcaacttag 

actttggggt 

tattgctttt 

aacactggct 

tagagggaaa 

aaaattgagt 

gacactttgg 

cataatgttt 

gacaagtcac 

atctttatag 

tcattagcca 

taagttggtg 

gttaacaaag 

tttcacaagg 

gcttcttgcc 

aaggtggcct 

ttatcagagt 

aattttgaac 

aataaaaaat 

ttcagagttg 

tgattggcaa 

cttagatacc 

attctgatgt 

atctttgtta 

ggaaggtaga 

actttgaaag 

attattagag 

cttagtc.tat 

tgaccaaagg 

ccnagaccat 

atcgcagacc 

ccaccacgac 

accttggtag 

agccaccaca 

ccctgagacc 

gcacacatct 

caggggttcc 

ccagggtgct 

aagaagattg 

gcctctttc'c 

gcctgtccca^ 

.tagggaacaa 

tgatatttag 

cttttggggc 

gggctaggga 

tttgattttc 

gccctcatga 

tccaacatgg 

ccacaccaga 

taagaaataa 

ccagaagagt 

tattgttaca 

tgtgtatgag 

aaagaaaatg 

agggttttca 

actgtctttt 

atgactttat 



gaaagttctc 

ccagttactc 

gaaaatgact 

gttttttccc 

aaactacttt 

t aaaaccact 

gggtgaatga" 

gtttcagaac 

gtgctggtaa 

gtgtgaacca 

ttatacaact 

aaatgggtta 

ttaaattttc 

agagttgttt 

tgagtttagt 

ttgatgggaa 

ggcaatgctg 

aagcagaggg 

ggcaaagctt 

gtcagagggg 

gagaatgtga 

attttacagt 

aaatatattt 

atattttaaa 

ttctggaaag 

acctaagtca 

gttcccagat 

tgaagtgtca 

acttgtgagt 

agtagcctcc 

gatggaattg 

ccatattgta 

acaatctaac 

ggaaggataa 

caaaatgcaa 

cccgtgctgc 

ccccagattc 

gtgaacttct 

tcagacctgg 

ccaatccatg 

cagagccttg 

cagaagggac 

ttattgagac 

cctttcctct 

tcattgtatt 

tttgcctcaa 

atgagatttt 

tactgggatg 

cagaatgctg 

caaagtgata 

atgggattac 

gaagatgtag 

ctccaaattt 

atttctgttt 

ccaagacgtg 

aatggcaaac 

taagtttaaa 

aacaactgag 

aaaataatga 

agatatccta 

tttcctgcac 



tggtaatggg 
acttgttttc 
acagtggaat 
caggagaccc 
taggagaact 
ctgcttttgt 
cattttgggg 
agcacatgtt 
ttagcagaaa 
agcacatttg 
gcatgataat 
tagatcaaat 
tgatactcag 
cagtgagtgc 
aaatatttcc 
ataagcaatg 
tataagtgca 
ctggctttca 
tctcgctctg 
gtaaaatata 
gtagaaagca 
attaacaaaa 
tatgttatgt 
ataataaatg 
gactcagaaa 
tcattaatag 
cctactgggg 
cataacttgg 
gatgaaattg 
ttcctctttg 
ttaatcaaaa 
aaaaatgagg 
agaattactc 
tcctaaaagc 
aagtcggggc 
ttcacattgt 
agttctggtg 
aataggccaa 
agagccacgg 
agagctgcat 
gaggtccaaa 
ttttccccaa 
ttatgctata 
ttcctgtttc 
ttggaagcac 
aatgaatctt 
ggactttgaa 
gaacgaatgt 
tggtctgaat 
gtattaagtg 
tgcacttaca 
gtgaaatatg 
cctggtgcct 
ttaatgaagc 
gtcctacaat 
aaaaggtcat 
acaacaattt 
agtaataaaa 
gttggcttca 
aaatatcatc 
agagaaagga 



ctgcaatgtg 
tcaattccag 
attta caeca 
agttgctgaa 
ggactgtctc 
aaccttttca 
gataatctaa 
gaagcaggag 
ttacaggaag 
agtttccaga 
ttatcattta 
gcaaatattt 
attcttcatc 
tgtaagatta 
cagtggtgga 
attaaacaaa 
aaaaaaaagg 
ttcatgcccc 
acagaaggca 
tagttttgea 
gagattttaa 
atgtttagaa 
atgaataaca 
ttttgttcta 
gtaaaaaagg 
ctattggtag 
agaatgggat 
ctgagttgtg 
tatatttggc 
ctgctcctag 
ggaaagcaaa 
aagtatcttt 
agtcatttaa 
aattcaaaaa 
agcagggtgg 
ggatctacct 
aactctggca 
tgcccagcag 
gcataaaatt 
tgtaagttgc 
ctctgggggg 
agtgggttgg 
atgttttctc 
tatgttttag 
ataacttgtt 
accttgagtc 
ctattgagtt 
ecttgettgt 
gttcatgtcc 
gtggggcttt 
aaagaagece 
acatgetttt 
tgatcgcaaa 
tgtgcaattt 
gaagttagag 
tatgettaca 
tttttaatat 
cctttgttct 
gtttggaatt 
tgttttgaga 
gacctaagaa 



ctggagttag 
ctctgagttc 
cattaattgg 
tagttaccaa 
atagcaattc 
gtaaaatttg 
tgtgattctg 
ectgatgeag 
ctctgtactt 
gaaggaatga 
cagtatactt 
aaatattttg 
tctaggaatt 
aaaagaaaaa 
acaccaattc 
ggaaaatcag 
ggaaaaccac 
actccagcct 
agcaaacccc 
attgtataat 
taatatgacc 
ctttagaata 
taaacaaatg 
gtccatttaa 
agagtaggga 
aaatatgggt 
tggaaaatgg 
ttcctgtcct 
tgeagcaett 
taaaatggga 
acttaaatat 
gcaagagaag 
cagaatctag 
ctatcagaac 
tttccaattt 
cccttactct 
gggaattgtt 
agecacaaga 
cttggctgga 
atctggcaaa 
caaagctaag 
aagggcagag 
tgttgggctg 
aatggggatg 
tgatttcata 
tcattcatac 
gatgttggac 
gagaaagaca 
cctcaaaata 
tagaaggtga 
aaggaaaacc 
etgtgaacct 
ccttttagcc 
atgatacttg 
aattttttta 
cacacaataa 
gaaagtccca 
tgttttgaca 
agggaattac 
tacacaacat 
ggattatttt 



tccacaatgg 
agtgatgtca 
cagactctac 
ttggtaatgg 
tatgttgtga 
cga<ggttccc 
aatttgagtc 
aataegtcat 
caaagacttt 
agctccagtt 
ttatgtacct 
catgacattg 
agctgaaagt 
taagagacca 
caagtttatg 
gagaaagatt 
agttttttga 
cagttccatt 
aagaggcccc 
attaatgttc 
caaggaagtc 
t attta act c 
tttggattag 
ttatgggtct 
aagcctcaag 
ggtgaagacc 
agaaaagtct 
gatattttgg 
tctaaacaac 
gaagagaaaa 
ctggaaaatt 
accaagggtg 
aacctgttgt 
tgccacccct 
aaagaaggga 
gttaccatgc 
aggggggcag 
gtagggctgc 
agatccacag 
gtcatgtagc 
ggagtaccac 
catcaaaccg 
tggacttagg 
catgttctgt 
ggttcacagc 
ctaacttaga 
caagttaaga 
tgaattttga 
tgtataaaaa 
ttatggtgga 
cctttctcct 
aggtaaggat 
tccagaacta 
gttatagcag 
agtgtctgtt 
atgtgtacaa 
gttagtttta 
gtgatgggga 
ctaccataaa 
etaaaggeca 
aggtttggat 



126360 
126420 
126480 
126540 
126600 
126660 
126720 
126780 
126840 
126900 
126960 
127020 
127080 
127140 
127200 
127260 
127320 
127380 
127440 
127500 
127560 
127620 
127680 
127740 
127800 
127860 
127920 
127980 
128040 
128100 
128160 
128220 
128280 
128340 
128400 
128460 
128520 
128580 
128640 
128700 
128760 
128820 
128880 
128940 
129000 
129060 
129120 
129180 
129240 
129300 
129360 
129420 
129480 
129540 
129600 
129660 
129720 
129780 
129840 
129900 
129960 
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tcagaaatgg ccattgtgta aagtatgtga 
aaataattgc ttatagtacc tgattttttt 
ttcttatatc tttaaaattt ttacaagata 
ttattttcaa attataaaag tatatagatt 
tggtcaaaac acttatattt ttatataata 
tgtgaaacct tttagatatt tgcattattg 
tatatacagt ttttttatatr tagcttgaat 
agtccattct cactctgcta ataaagactt 
aggtttaatt gactcacagt tcagcgtggc 
tggaagggga aacaaacacg tcgttcttca 
aagaggggaa agccccttgt aataccatca 
aaacagcagc atgagggtaa ccagccccat 
acaacacatg gggataatag gaattacact 
actatatcat ttcatccctg ccctctttca 
tcatgccctc ccatcagtcc ctcaaagtct 
aagtccaaag tctcatctga gacaaggcaa 
aaagcgagtt agttacttac tagatacaat 
tatcaattgg gagaaattgg caaaaacaga 
ccaatagggc agtcattaaa ccttaaagtt 
tccaggtcat attggtgcaa gaggtgggct 
gctttgcagg gtacagtccc cctccccgct 
gcctttccag atgcacggtg caagctgttt 
gggtggccct cttctcaaag cttcaatagg 
ctctgacccc acatttcctt tccacactgc. 
cctgtagcaa acttctccct ggatatccca 
agaggttccg aaacctccaa tcttgtcttc 
agctgccaag gcttggggct tgcactgtct 
cgtttagtca cagctggcat tgagccagtt 
acagagcaag ggggccctgg ttttggttca 
gcttgtgatg ggagcggct.g ccatgaaggt 
ttttcttggt gattcacatt tggcttcttg 
aatttctctc cagaaaatgg. gtttttcttt 
aacttttatg ctctgattac tcttgaatgc 
accctaaatc atctctctca ggttcaaagt 
ctgccagtct ccttgctgaa gcatagcaag 
cctcatcttc atctgagatg accacttcac 
agccattcaa caagtttcta ggaagttcca 
cctccaagtc tctagaaagt tccaaacttt 
ccgaattgtt ccaacctgtt cttgttaccc 
atccttatag ccccactccc ttggtactaa 
aataaagacg caccccaaac tgggtaattt 
tcctgaggct gggaatgtct caggaagctt 
tccttgttca catggcagca agaaggagaa 
aaaaccatca gatctcacga ggactcactc 
cccatgattt aattacctgc taccgggttt 
acaattcaag atgagatttg ggtggggaca 
taagttgcct cactttaata acaaggttga 
tatgtgtgtg tgtgtgtgtg tgtgtgtata 
ttaagtgttc catgtactac tgtattgagc 
aggtgaataa atgagcggat aaaatatgga 
acttatcttt gagaagcatc ttttgccttt 
gtaacttttg tatatgttat tatatgtgtt 
tagcaaattt tattgagaat atttttttct 
aaacagtttt tatagctctc ttttgaacta 
tctctgatca tgaagtacct gcaaactgct 
attaccagga tggctgcccc aaattgttgc 
ggagcctaca gccttgtaaa aataggaaca 
cctgatttat gttttcttgg ttgtttgcag 
catgtgtggc tcaggctgac tccacaatca 
tcttcttaga agatgtttgt tctgacataa 
aggcttctcc ttgcacaatg atctcta.ttt 



gagcagattt gaatatttgc catagtgcag 130020 
aagttattat ttcatccata atatatcttt 130080 
aaatgaaaat tgaactaact tagacaaagg 13014 0 
tcaaaatgtt tattgaatac ttatgcacaa 130200 
cactatgatt tttagcattc tgagcaaata 130260 
ttaacaaaga agttagtgaa acatgtcaca 130320 
ttcttaaaat cagcctttac caggtatatt 130380 
acctgagact gggtaattta taaagaaaag 13044 0 
ttgggaggcc tcaggaaact tacatcatgg 130500 
catggcaaca ggaaggagaa gtgctgagca 130560 
gatcttatga gaactcattt gctatcatga 130620 
gattcaatta cttcccaccg ggtccctccc 13068 0 
caagatgaga tgtgggtgtg gacacgccaa 13074 0 
agtctcatgt cctcacattt caaaacgcaa 130800 
taactcattc cagcattaac tacaaagtcc 130860 
taccctttca gctatgggcc tgtaaaatca 130920 
gagggtacag gcaatgggta aatacatcca 130980 
gaggccacag gccccatgcc agtccaaaat 13104 0 
ccaaaatgat ctttgactcc acgtttcacg 131100 
cccacagcct tgggcagctc cacccctgtg 131160 
gctttcatag gctggcattg agttcctgca 131220 
ggtggatcta ccattctggg atctgaagga 131280 
cagtgcccca gtggggactc tgtgtggggg 13134 0 
cctagcagag ttcttcatga g^gctccatg 131400 
gcatttccat acatcctctg aaacctaggc 131460 
tgtgcatcca ctggcccaac accacgtgga 131520 
gaagcaacag actgagctgt accttggccc 13158 0 
gggactcagg gcaccatgtc ttgaagctgc 13164 0 
ataaggcatt ttttcatttt aggccttcag 131700 
ctctgacatg tcctggagac attttcccca 131760 
ttacttatgc aaattcctgc agtgtgcttg 131820 
tttattgcgt cttcaggctg caatttttta 13188 0 
cttgttgctt agaaatttat tctgccagat 13194 0 
tccacagatc tctaggtcag gggcaaaaag 132000 
agtcaccttt gctccagttc ccaacaagtt 132060 
tgtccatatc actatcagca ttttggtcaa 132120 
aattttccca catcttccta tcttctgagc 13218 0 
tccaagtttt cttgtcttct tctgagccct 13224 0 
agttccaaag tggtttccat atttttgggt 132300 
tttactgcat cagtttgttc tcacactgct 132360 
ataaaggaaa gaggtttaat tgaatcacag 132420 
acattcatgg cagatgggga agcaaacacg 1324 8 0 
atactgaata aagagagaaa agccccttac 13254 0 
actatcataa gaacagcctg aggaaccacc 132600 
ctcccttgac acgtggggat tattggaact 132660 
tggccaaacc atatcaccaa gatataaaat 132720 
gttacatttt aaaattatta ttaaagtata 132780 
tatatataca tatatgtctt taaagttatg 1328 4 0 
acatagaggt gctcagtaaa tgtcaatgtt 132900 
ccaacttagc tggtcagtga gtttatggtt 132960 
ctccatattg agattgttta attgagctgt 133020 
taaattttgt atatattttg tctaccttag 133080 
caaaatagca tccctttaaa ctagaaaaaa 133140 
aatactaaag aataaattgg tcaagttctt 133200 
tcccataaat ggcagaaaag gttaagagac 1332 60 
tggtatcatt aaaatagtca acaaaaatgt 133320 
tttaaatcca tactttagtc atttcctctc 133380 
gcgcagtcaa cagcagtgga agagctggct 1334 4 0 
gagtgattat gtggaaatca atcagaattt 133500 
tgactgacat tccagctaat ttttattcat 133560 
ctctcaatgt tattctgtgt ctttcttgct 133620 
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tgcctgcaca ttctgaaaaa aaaaggaaag 
tgttgtttgc taacatattc agaacacaat 
tatattagga aaacatcaat taggtaccac 
aaatgtggtg aaaatagagg ggcaatgata 
tcattatgcc aaaattattt tgtggttcag 
taaactttaa ttaaatttat tgtagtttaa 
aaacaccaag ccaagctctt - tcctagttct 
attattgtaa tttttaaacc accgagttct 
cctgaaaagg accattcttt ctggcactga 
atgtagtttg tgctgtctgc cacagattta 
cccttactaa ttgccagcct attgctgata 
taccactata ggcatttaaa ggatagctga 
tatctttagt ttatcctaaa gtagcagcta 
taaccagaga aagtaaaata tcacagagta 
gctcaagtaa tgaatcagga tgctattaca 
ttcctaaatt ttaagctttt aacaaaagca 
ggaaaggaaa ttatcattta ctgtctttga 
attgatcaat ttataataaa ggtaagagaa 
ttttaacttt cataattaaa atactgggta 
tatatttgga aacaattata ctttatatac 
gaaattgaag ttcacaataa tgactgaaca 
aaaatttcca tggcaccttt ctctttccat 
gcttatatgt ctagactcta cccctcattc 
attaacccac tctcgatttt tctgttgcac 
aaatgcatga gctacctggc cgagtaatct 
ggccaatgta aagttatgtt gtaactacct 
taaggaagag aagaagccaa tcttgaatca 
tacaagcaaa aacccatttg attgaacaag 
cttgtcacaa aaagtgtaaa gggaaagtaa 
atattaaaga ttcaaagcag ggatataagg 
tataaatgcc ccaagaatga attcgatgtt 
tatttttaaa gtcaacaggc tgcccatttt 
caagtactct gatgagttct ggagatgcta 
aaaatgtagt tcacaattta gccatggagt 
ccctcctcca tccctctccc agacacccat 
ttttgtcctg tctttggtgc actatttata 
ggtagtaaga acatcagtat ggagacccct 
tggtattaat tcttgtttat tgaacttgag 
tactggtgaa cattacttta ttaatgatta 
tagctgttct ctccttgtag cctgtaaact 
gattttgctt aaattaaaat taatacatca 
tcagttaatt tagtttttag aaatctttta 
caaggatttt tgggatatta aaattgtcac 
tttttctttt ctcttcaaag tgttgggaga 
agatgttatc attaaacatt tcagaaaatg 
tgatcagcta gtagaggttt tggtgaacat 
tatgtgaaat ctattgcttt tccaaaagaa 
actttcatgt aagaaaaaat aaaaacataa 
aaacacttga ccacatgttt gagtacccca 
ggatgatcag tcctcatggc cttactactc 
gttcacttgc cttgtcagtt aagtttctaa 
ttaaccacag ttaattgctt aatagactct 
tgcttggaga aaatgcatta tatattttat 
caaaggcact ttcaacagtg gccagggcca 
gccatgtgtt caatctagta ctttgggatg 
cttctcctta gagcttaatt cattattatt 
aaaataattt taggagtgct ttggcaaaag 
taaaaaaaaa aaaaagacta gnnttttttt 
atttaggttt taaaaatgta tttcctagcc 
ctaaattttc tctgaaaaca tgagacgcac 
ttttaataga ttttttatgt ttgaagagat 



caattcccta cattttattt tgatacttaa 133680 
gcaactttat atattagagc acttataata 133740 
cagttatttt agacaggtat ctcttaggta 133800 
gaattcatat gtgaaagagt caattggcca 133860 
gggacatttt aaactttaat ttttaagaat 133920 
atgcaatcac aaggaaagac tctgaactgg 133980 
ggtcccctat aaaccctgaa agttaatgca 134 040 
cagggtgatt agttatgcag caatagatag 134100 
aaattaagaa gagtgattaa caacataaga 134160 
tgttgtagct tcagtgattg cttcaaagca 134 220 
cctagctaaa ttagcatgtg ttagttttga 134 280 
tacattctat ataaaatata attattttcc 134340 
ttatattcgc ttcttgatta gctttataca 134 400 
tatttgaaga cataaaagtc tttggacaaa 134460 
tcctccacat gacagtaaat at at tt tat a 134520 
aattttatac tggactattt caatttctag 134580 
ggctataatc tatttagcca ttatttttat 134 640 
aatactacca catatatgtt gtttatataa 134 700 
tactgcaaga .tgtcaatatt tataatagct 134760 
tcctaatcca. ttgccttgtg tgtcattgaa 134 820 
aatgctcaaa gctggtctgt ggcagaacca 134880 
attggggtga tctgcctaat ttgactttct 134940 
cttctgctag tgtgttgttg aac'cctttct 135000 
aatcaatctg tagcctgaat gttgagtccc .135060 
aactagatta gcctgagtat atcactttaa 135120 
catagttttg cagaggcaag aaatagaagt 135180 
gttagaattt ccttggacat agaaaatctc 135240 
caagcttatg cagttaaata tatcaaagat 135300 
aataaggagg gagcatgaag aagccagctt 135360 
aaattcaagc caaaaataag gaagaattct 135420 
tggtcgatat tccaacagat ttagtaatgc 135480 
tgcctcacac tctgactgca tgtaatctgt 135540 
atgcaaaaaa gatataaaat ccttaccttg 135600 
tatggcaagc atcttgtctt caattttagc 135 660 
tgtcctcttt ctcaccacat ccaccccctt 135720 
attcttctca tgtctaccac aactattttg 135780 
taatcctaac taatatcaac tctatctgtt 135840 
gcattgagta ctacaaaaat tataatacac 135900 
ggttccactt ggatgtttgt cattcgtatg 135960 
ttctatggat gtcacaaaaa gagttttaag 136020 
agaaaatact atgcctcaaa caaaaaataa 136080 
caaaaaacgt gatatttaat aatctttatg 136140 
agatttaata atacgtttta ttgtataaaa 136200 
gtgaattctc taagtaaaaa ctgaagagta 136260 
cacactatcg tagactgcgc actgtgaaaa 136320 
atagtaactt ttatcaacaa gtaatgtgaa 136380 
agaaactttt acacttcttg gtaaagataa 136440 
ttttagcaat ccaaaccata ctaaaatcat 136500 
aaaattgttc ctgtgtggat tcttcttcta 136560 
tccatttctg tcccactgag gttagccaaa 136620 
atcagatgag ttatgttttc tataattgag 136680 
gtctacacta actgaaaaaa aagaattaat 136740 
tcacatctac agtcatccag gccaaagtga 136800 
atggcgacag cggggccagt gcttagctca 1368 60 
tggatataaa gaaaatgagt tataaatgaa 136920 
attattatta ttattatttc tttacataag 136980 
tttaggtaca atttaagaaa atgccatttg 137040 
ttaattaagt ttttactttt aatgatttag 137100 
aaggtctaac accaggtatc tgagattttt 137160 
gtacagttat ttaaaatttg ctgcagccat 137220 
gccaatcgct tattatagta agctatgata 137280 



72 



WO 01/40493 



PCT/IBOO/01507 



gcaatgccaa tgaatcctca attccattgc ttaaatataa gtgaaataat aaaggaaaag 137340 
accagtattg aatatattgc ctcttcctgc tacactctaa catattcact acaataaatg 137400 
gaaagttgca catacaataa aaacttaggc aatttattta tttacttatt ttttatttat 1374 60 
ttttttgata tggagtctcg ctctgtcgcc caggctggag tgcagtggtg cgatctcggc 137520 
tcactgcaag ctccgcctcc cgggttcacg ccattctcct gcctcagcct ccctagaagc 137580 
tgggactaca ggcgcccgcc accacgcccg gctaattttt tgtatttcta gtagagacgg 137640 
ggtttcaccg tgttaaccag* gatggtctcg atctcctgac ctcctgatcc acccacctcg 137700 
gcctcccaaa gtgctgggat tacaggcgtg agccactgca cccggccggt aatttattta 1377 60 
agtataattg ataatcaggg caatatcaga aaatcttgaa taatttgtac tctaaaat.ga 137820 
atttatgctt ttcagtgctg gaagcattat agaggttgtt tcaactatga aatcattttg 137880 
tcaaacgttc tttgaagtct atgaccttaa gtaaagcatt taaaacccaa tggtttagaa 137940 
acgtccagag aaacaagtat attggcaatt gtaaataatt tttgaagtat ttagcaaaac 138000 
agcgaagtac aataagcatc ctattcacaa ttccaatgta aaaagagatc tggtcatagg 138060 
cagaattata ctaagaaaaa ccctatgaag aaaataaagc ttgtttgtta cctccaagat 138120 
ttcaccatct ttttcccgtg tccctgcatc tatgccctac tgtttctgct tcctcattaa 138180 
tcttaaatcg acttaagtgg atactggttg tttaaatatt tcttatctgt atcacaagtt 138240 
caacaaggaa aatatcttga taataaaaaa agccaaagga agcttttaaa aatttaccca 138300 
tggattgtaa ttacccatga tccccatcct attcctttag aactgttaga taagagtttt 138360 
ctacacttga aactttgagt gtttctaaaa atatttcctg ctttgagctt accaagttga 138420 
tagataccgt ccagacaggg tgaactgaag caagagtccc tggctcgaaa gtgtgcctgg 138480 
catgttctaa taaaacaaaa aggccgcacg ggtgctccta aaagggaata aagtcagcaa 138540 
gtggggagag gggcatgtat gacttcttgt tgattattag aagaacattg agaaatgcaa 138600 
aaccactttg tggttttgtg aatccactgc tgttttcagg atacactgcg aggggtaatg 138660 
gcacaaaagg gaaatctcct ggttgatagt ttggtaatct agacagaagg tggtgttggc 1387 20 
tagaatcaag gtaatagcaa gtcgagttat cagaaatgtg gaaattggtt ttatattgag 138780 
aaacaagccg atggtgctgc tgagaattgc acgtgtagtg agggagaaca aagggactca 1388 40 
agagtatatt tttttaacca aagcagctag aaagatgaca ttgcctttgt acccttcatt 138 900 
ctctaattgt attccccatc tctgccttca ttctctgatt acctttccca acaggtagct 138960 
ttagcttgtc cacacctaac tatgcttctg tctagggcct tgacacttcg cgattccttc 139020 
cacctggaat gcagttgccc aagatatcct catcaattat ctctcatatt tttcaagttt 139080 
ttatttgaat gtcaccttt.t caatacaact tgccttgccc cccatcttag attgtatccc 139140 
cctaaataca cttcctgtct accttctccg attcgttttt ctccatagcg cttatcccaa 139200 
tctaacaaac atattttact tattttattt attgcctatt tccccagtaa aaaggcagct 139260 
caataacaag gatgtctgct tgttttgttc actactctct ctccaaagcc tggaatagta 139320 
tttggcatat aatagcacaa taaatatggg ttagatgcat gaattaatgc agggaattag 139380 
ccaagctatt tactgtactc tggtctctga gaaaatttgc aagcaggaga aagattgctc 1394 40 
tatg'gaaaaa cgtatgaatc aaagcaaaaa actccatcat ggtgtatcca gatgaaagtt 139500 
gtttattgta atttagtttg tgataaagtc ttcatcataa ggaggatgaa aacaaaagcc 139560 
ctagtagtat cagatcctca gagaagagct aataagctga gctccttaac atgactcatg 139620 
ctttctctaa cttgggggaa tggttggccc caactctcac tgagtctgca cactgaggaa 139680 
aactcctcaa aatctagcag cctggaagga tgaatgcaaa agaacagtaa caccaatatt 1397 40 
ccctctaccc agctgacatt tctctcccat ccttcatttc cttaccataa ccctctgtgc 139800 
tagcccattg ttcatggata atgtccaaga atgtacaatg atatacaaga tcatgataat 1398 60 
tgaattccaa cttctgtcat ttttcttcgt gcccccccat atccagtgat tctgaactgc 139920 
tacagttctg taaaccaaac atcccttttc cccaatttta taactaaacg cttttacttt 139980 
tttattcttc ctagaatagc ctttcttatc tcatatcttt gtaaatttta attcatcctt 140040 
taataatcat ttatagcagt tcttcattag tgaaattcct gatacatttc ccaccctctc 140100 
tgaaaaatta attgctccct tattgatttc ccattgattc taatagttat gaaatatttt 140160 
attacaatta ctggttcaca catctgaatt ctgcttggag ctgtgaagaa tctgaaggca 140220 
agaaacatat tctactttct ttgtataaaa tggacttatc acataataat ggctcattag 140280 
ggtgaatgaa tgattaatat taacaaagaa aattgtcaaa attaggtaag cttcttttaa 140340 
aaagttgaag ttaaacatta gccatatgtt ttctggaaaa ctgagcactt tctgatgtgt 1404 00 
ttgaaatcac attgagaaag tcgtctcttt tgctactttt gtttactgga atgaagggta 1404 60 
actggaaaga tttcccagag aaaacaagac cacaaatgtt ttgaagtagg gaaaaatatt 140520 
ttaattgaaa tagaggtggt acaaagacaa gaaatatgac atgtgataca gtagatataa 140580 
gcgtatatga aatatatggg atgaacatcc atacaggttt cacaggactg aaaggtgcaa 140640 
ctggtacaga gagtctggtt gttgaagtat ttaaaactca caatcaaagc tctgatagta 140700 
ggatgcaatg tcaatgatta ctaatcgaga caaatttgct caagatcatt gtataattac 140760 
acccaaattc caatttgtga atagatttac tcatcgtgac ctagctgata acatggaaag 140820 
ataaacaagg cagatacaca gaacattttg cagactatgg tggctgcaag aatagctttt 140880 
cgcttgattc aggctttaag tgttagcctg aacactcaga acgcgattta ttgttggaaa 140940 
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gggattaacg aagacattta tttcagaggc 
■gttgactttt ttgttgttgt tcttccctct 
ggcttgtcaa tggggaggtg ttcgttcctg 
atgaatctat tacacagaat aaggaaatac 
ttgatgagca gtacactgca acatagctca 
caaatatgtt attcatattt gctgagagtt 
agctcagtgg aaagatatgt" gtttatctga 
tgtacaaatc tgtttttaac attgatatta 
atgcaaagtt tggaacttgt ttcacatttt 
atatacgaga ttttaaaagt ctatagggat 
tttatttatg gaaggggatt gcttgatcct 
tgacatgata taaattggtt tagagaaaac 
tgatgctgtt ggagcatagt aataacgtaa 
tctagaacta gatgggtttg tcctggatca 
caggggtgat ttttgacttg tataaaacca 
tcggtttaca atctttggaa atgtgtattt 
tcaagaaaag ttgtttttaa tgctgctctg 
gcaagcaagt cgtttttgtg ttttacttgc 
gaagaatacc tgagtttagg accatacatt 
gtatatatag aaaagtgcac aaatcagaag 
agccatgtaa ctagcatgcc aataaaataa 
cttctcatgc ctcatcccaa aggtaattgc 
ttggcctatt ctcttgaaca caactcttca 
gcaagtggaa tttgccttct cccactagtc 
tcagaggaaa tctaagcact ttaagtggtt 
■aaagaaagct ggaagttgat aaagtgatcc 
tggccatgtt gttcctgcca caatgtttcc 
acagtaaaag tttaagtaat gccatattcc 
aataggtagc. aaattttaaa gatcagagaa 
tggttagaga tctttatata tcaacatatt 
gacgtttatt tttaacaatt agtattctat 
gcttaatgag ttcttactgg aaggggacta 
ttcactcaag ctattagcag gaaagattgt 
caattgaact ttcacttgga tgaaacaaga 
gtctttctcc cttttcctta tgctatttcc 
tttgaataat ctatgattat tttccttctt 
taagcagcta ccaagtcctt aagctagatg 
tcagtgtagt cgatggtatg accctgttac 
cacaatttgc tcattatcaa actctcataa 
catatgactc ttaacctcta ctgaaagatg 
gatataatat attggctggt gaagttttat 
tttctcagtg aaaatagtga acactattca 
catgagagag tcagtgtagg tagttcagtg 
caacagtgac atctgtgagt ttgttacaca 
tgaatatcca agcatgcgat ccaggaaatt 
atatgcatta aggtttggga accactgaca 
gactgtgtaa gtttatattc atgttctgtg 
atttatcttt ctggatctca gtcttcttca 
catcacgttg ttttcagaat tgttgaacac 
acggtggctc acgcctgtaa ttccagcact 
ggtcaggagc tctagagcag cctggccaac 
aaaaaattag cctggcattg tggcagcagc 
cgggagaatc gcttgaacct aggaagcaga 
actccagcca ggaaacaaga gtgaaactcc 
aaaagaaaaa gaaaagtggc atgctcatag 
ggatcttgtt ttcataatac aaattaatta 
tatttggaac cttttctttt aataataaag 
cacttttaaa aacgttgtgc attattttaa 
ggaaaactca tacaaatata gtaaattgca 
ttcaaagtag atgcaattag tgcttcaaat 
catgatttaa caaacaaatc agagaagaga 



tactttgttg ttgttgacat ttttgttgtt 141000 
catctgggat atttcccttg cccttctgac 141060 
ctttatctat tgcatatctg tctgcttaat 141120 
atttagggct gactaatgta aaaatactct 141180 
gagttttcaa atttattata ttggctcttc 141240 
gcttttataa atttctgctt tggagcatac 141300 
attagctggc tctaatccca aatgaatatt 141360 
atttattaga caattagaat atggcaaatg 141420 
tttaatggtt tgtactcact cccctccccc 141480 
atcttagtac atatttatgc agattacttt 141540 
ccaggtggtt ataaaactgc tatttcattt 141600 
aaattattct aaagtcttta gaaaccaaaa 141660 
ataaatctag aagatggaga tttttatatt 141720 
tgcctctgaa tttagtctct gatttttatc 141780 
tggaaaccaa gagccatcat aaagcaagaa 141840 
acgtagacca tgaaataaaa aatatt.ttgt 141900 
aacagcagac caaaaataac atttttaaga 141960 
ctgccttata aatatggaat gttggaccca 142020 
cttttaggta aaattttatt gaaaattaaa 142080 
cctacggctc aatggatttc aagtgaatat 142140 
cacattgata gagcattacc cgacccgggc 142200 
tattctgcct tttaattatt acatattagt 142260 
atgcttgtaa gttccatttt agaagatctt 142320 
agtctttaac caggagatat caagatggaa 142380 
ttgccgtgac agcatccttt gtgattactg 142440 
atgttgaaag tacagattgt gctttgatag 142500 
ctgcactatt tttgcccaat attggattta 142560 
actaacatat aacaaaaaaa tggaattaga 142620 
aaaggaattg tgacatttaa gaattacatg 142680 
gaaagaaatt aattcagtca gttcaggcat 142740 
tctctataat catgttccac agtttaaagt 142800 
tttcgggaga gtataatggc ctttaatcac 142 8 60 
caggaaatat tgttactttt gttataaaat 142 920 
agtttgtcaa tatggtatat aacctccaag 142980 
ccctcgtctt cgccatgtgt tattgattca 143040 
aatagcctga gaagaaaaat agcttctaac 143100 
ctatgtttat attatctcaa ttaatcccac 143160 
aattgtgcaa aatggagttt caggaagata 143220 
catccacaga aaataattga aacacaaagc 143280 
cttttatata gaatctaaaa tgttatgtta 143340 
cttttccaaa acagaacaat attttagtgt 143400 
gattcaggtg atgatatctt tgattatcct 143460 
ggtctcaaag catagtacct ggacaagcag 143520 
cacacattca tgagccctac cttaaatctc 143580 
ctattttaat aggttctcga gatgattctt 143640 
tatttcttaa aagcacgaat tgcggaaaca 143700 
aaatactagc tatgtgacat tgaacaatta 143760 
tttatgaaac ttatacaata atcatatctg 143820 
acatagaaaa gtgtcatgct agggccaggc 143880 
ttgggaggcc aaggcgggtg catcacctga 143940 
acggtgaaac tccatctcta gtaaaaatac 144000 
ctgtaatccc agctactcgg gaggctgagg 144060 
ggttgcagtg agctgagatc gtgccactgc 14 4120 
atctcaaaaa aaaaaaaaaa aaaaagaaag 144180 
caagtactat gttagtgata gtaaatgcat 144240 
atgatataaa gcattcatta aatatctcaa 144300 
tctagtaaat gttgttattg aatagaaata 144360 
aaatatatga atcagaaaag gaaaggtgat 144420 
ctgtttttgt gaagatgatt ttatgcagaa 144480 
tagaaactgc aaatgatgaa ttcacaagca 144540 
aaaagtttct gatgaatttg agtttctaca 144 600 
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tgtggatttc atcttgtgat aagccattct caggagaaag catgagatgt tattcgtcat 144 660 
tccagaaagt gggagaacat aaaagttgag aacagttatt gcataagcaa gaattatcaa 144720 
tgaacacaat taatttaata catacacagg catttgactg tatgagaaaa agcatttcat 144780 
gttatttgct ttgtagatac aatatcagtt tcgatgagat tcatgctgca cggatggtta 144840 
cacctgtcac ctgtgggtgg gtaatcttaa tacctgtgat gacaagtaga ccacggggga 144 900 
ggactcgcaa aggatttgag tgaggagcca ctgatgctgt gtaaatgcca gagccgtgac 144 960 
cgtgcacttt tggggggatg* tgcttaaata tcttcagatt tcacagaagt tagggcatca 145020 
acggaaaata cttttgtttt ttctagctca tctgcacaag ttgaacacac attgttagat 145080 
agtgtgtatg tcagagcata tgtgaacaaa tgaaggcgaa aagtcaatag ccatacattt 145140 
gtatttgtat tgctctttat caatctaaaa aggtacatta aacaataaaa tgacaattca 145200 
atttttaaca gcattctaga ctctagaagg tgttttctaa tccactctct taagccattc 145260 
tcacaacagt acaagggaga atttcaatga gaccaggtca cccagcaata tggtgacaca 145320 
gatgagacac ctcctcagat ttttgtttca aaatttttgt tctgcctact atgccataca 145380 
tgctgtaatc atttaattag aatttatatt ttaataaatg aaaactaaag agtttttgca 145.4 40 
attgtgttaa cactcacgtg ttaaatttgt aattatgtcg gtaaataaaa agagtttaaa 145500 
tccacacccc attagccagg cgtggtggtg cacatctgtc tgtaatcg'aa gttttagaaa 145560 
actgaagtgg gaggatcact gagcccagaa gaggtggaaa ctgaagtgag ctatgattgc 145 620 
atcactgcac tctagcctgg gagacagagc acgacccttt gtccaaaaaa agaggaaaaa 145 680 
aaaaaaaaag aaaacaaaaa ggcgcaacac agagagaagt gatcttatcc ttatccttat 145740 
gctagatata tatttttttt ctggtcagag aattacaagt ctgtctaggt aggctcacat 145800 
ttgagaagag aaaatcctgt tctattgact tagagagtta aagaccctgc cataagcaga 145860 
gagggcacac gtgggactct gcgccaggac aggcacaact gtgggggacg cttca.tagca 145 920 
gcttttgcag atctcctctg cacctgaagc gccgcctgta gaagctagat gaaacctcac 145 980 
aatgccaaag ggttgctggg cttaggaagc acgggctgca tagggataac tagatcttta 146040 
tggatcctga agccactgcc ctttgtctgg atatctgtct ctttgtgtgg ccttataggc 146100 
tgcatttttc ttgcctgtat gcctgaggtc ttgatttagt cacactggca gggtacaaac 146160 
tggcctgact ctgccagtga cttcacagtg cagttgcaat gtcatttagt taccttccct 146220 
tggaggaact gctactgttg atgaagagtg ctttggacca aatattttat gggcccaaag 146280 
ttcatgtatt aaagccctaa ttcctaatgt gatgatgtga ggaagtggga cctctgggag 14 634 0 
gtaaataagg tcatgagggt- agagccctta tgaatgagac tagtgtccat gtaagaagag 1464 00 
atggaagata tgatctttct ctccaccatt tgaggacaca gcaagaaggc agccttctgc 1464 60 
aagtcaggaa gagagccttc accaagaacc ctctctgtca gcactttgaa ctcagactcc 146520 
ccagtctcca gagttatgaa aaataaattc ctgcatttta ggccacccag tctaagacag 146580 
agaggtaaaa tatatagtgc ttgggccact cgtttgttat taaattcact atgtattcag 146640 
cttcacttat atccttagaa aagtttgtga atgcagttga atgggtggag aacggtagtt 146700 
gcagataatc agagaagaag agtagagaca gggtgcaggg gaaagtaggt cctttaaatt 14 67 60 
gaaaaatgat aacatgatcc attcatataa atgcactttt gttcaattct gctccagctg 146820 
tattatccag cttctctaaa gaatggtgcc tgtacaggac agggtgatct ttaccctgat 14 6880 
gaaagatggc tctccaggca gatagcagga gagacattcc taattgattc caggagggaa 146940 
tgccaccacc atcaccacta tcatttatgt tgatatcaaa actgaaatgt caatgatcat 147 000 
ttctgttctc taacatttcc aatcagataa tttaaacatt ttactctaag agtttttcaa 147060 
ctcaaatgtt atgcttcctg ttttgtgcat ttgtattatt aactctagtt tcatgggagt 147120 
cccagttttt atacatcact gtatagcttt gatgttaaca tgattttgag ttaacataat 147180 
ttttgccaaa gtgagtatta acctagtttg atcacacaat ttggttattc tttattctga 147240 
tgctaaggga atccaatgat ttttgttgat agtaacccat caattctatt gactgtaaag 147 300 
agtaatctag aagaaattta aagaaaattt aatcagattt tacgttttct gtcgctaatt 147 360 
tggcatctta atagctgttt attttattcg aggtgcatcc tgagtcttgt aatgttcaca 147 420 
ttgcaatatt aaagcataac agatgagcct aaaatgttca gtggttaatt tgtacagatc 147 480 
aagatgttgt aattgcaata accttaatgt ggctaaccct tgaataacat aggtggaaac 147 540 
tatgtggata aatccactta tctgcagatt tgctgccacc tcggttaccc cagacacaga 147 600 
aaaaccaaac cctcctcttc ttcccccttc tcagcctact cagtatgaag acgatgagga 147660 
tgaaaacctt tatgatgatc cacttccact tcatgaatag taaatatatt ttctcttccc 147720 
tatggatttc ttaatagcat tttcttttct gcagctttat tgtaagaata cagaatataa 147 780 
atcatacaac atacaaaata tgtgttaact tactgcttgt tatctgtgag gttttcattc 147840 
aacagtaggt tgtcagtagt caagcttttg aggagtctaa gttacataaa aattgctgac 147 900 
tgcgtggctg gggctggggg agttggcagc cataacctcc acattgttca atggtcaact 147 960 
gtataatttt ttagcggaag tgtaggaatg tttagtatga tacaaccatg gagaaatatt 148020 
ttgtataaat atgtgtaaaa ttaagactgt agaactaaaa ataatacact tagttcatta 148080 
gttattaata gctaatattg atttagctct gtgataataa attgacatgc attacctcat 14814 0 
ttgtcacatt gtcatgagat caggattttt atttgtcttt catttacaaa tgaggaaact 148 200 
aagtctaaga ggagttaagt gatttaccca gtattacatg gcaaaaaaaa agtcagagtt 14 8260 
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taaacttaaa gccgagtctt tgcaattcta 
aacttattag tcaacctgaa ttctgaggat 
gaagcattta tattatgagt ttctaaaatt 
aaatcatggc atttagaggc aaacatattc 
ttgtgaacct gacgagtacc a.tttagtaac 
tttcatagag gtactgggga aagttaagcg 
tttcattttt tccttgtgtg- tatacttaag 
tccactgaaa acaaaacaaa acagaagatt 
ttgtctaact caagttgttc tgatatcatc 
aaagattgag ttagcaatgt tccctcaatt 
ttcagtggta ccacaattct tccatgctgt 
ttttctcttt tgcatcatta ttttacattt 
tcttataatc ccatgcttct aatttctagt 
tgtacaatcc aaataacact ttcttctctt 
aacactgtgt catatctgtg gtgtcaccta 
tgtagtgtga ttcttcttct tgtgtattaa 
cctccttgat atttttcagt tccaggcact 
accttgttaa cacctggaaa ttctccatct 
agtaaatggc at cat ct etc atagctttta 
tttgtttttt ttctgcctgc tgcttcttcc 
ttcactcaca gcattttcaa gaacatcccc 
tgtgaatatc ttaacaatta tacaatacac 
gacagctgtg taacagaatg ccctggtgca 
cagtttcagc tggcctcctc ccttaggtaa 
taatgttcaa gtatctggtc ttttctgeta 
ccatattaag tgtttttttc ctttaccaga 
ttttcacttt ttttacctgc caaacttttt 
ctttatgaga tttcagcttt ttctcttgga 
ataatttcaa atgtctgctt aaattttgaa 
tttcagacta ctaagcaata gaaatatgga 
tagtaagtat tcaagaatac . tgaatacttg 
ttcacaccgc tataaagaac ttcccgagac 
tgactcacag ttctgcatgg ctggggaggc 
gaagcaggca tgtcttacat ggcagcaggt 
agecacttat aaagctatca gatctcgtga 
gtggaaacca ctcccatgat ccaaacacct 
attacaattt ggattacaat ttgagatgag 
agggagtatt ataattggta aagaatgact 
ggaactaa.ee tatatttgtt taatgcaaat 
cattttttac ttatctttcc aagtctttgt 
agaagacaag aattaaaaaa gaagtagtaa 
agctgaggaa gtgagtagta atctaaaatc 
aaaagagaat gagcattggg tccaaaaaac 
cagttgeget gatcagtcta getttgeage 
ttttggccaa atactatagg gttgtctgtg 
agagtcaaga ctgttagcta tgttttgaga 
ttgacagaag agaagtgetc tacatgtgcc 
gttaggtaat ttttagtaat ateattagee 
ggtctctttc cctgcaatct cagacagaat 
gtatggacaa aatcaattgt actgagtata 
ctctaattgt acacagcata aaatgttgac 
agaattcagc ttatcttatg gtcagatatt 
tacaaagact tggatgacat cacaaagtgt 
accttctttt gaaacaatag aggcegggea 
tgggaggeca aggcgggtgg atcacgaggt 
gtgaaatctt gtctcttcta aaaatacaaa 
agtcccagct acteggaggg gtgaggtagg 
gcagtgagtg gagattgtgc cactgcactc 
aaaaaaagaa aaaaggaaaa ataggagtac 
taacgttagt atttcttcaa ttaaaatact 
agaaaaatac ttttggtttt attaaatacg 



aagtccatgt tcttagctac tgcctttctg 148320 
ctcacaaaaa gttacgtgct aattattttg 148380 
acaaattctt ttatgtatga tagggaggta 148440 
tatttagttc ccatttgtgc tacttaacag 148500 
ctcaactgtt aaggtgtgta aacaagttca 148560 
agagaatata tgactaacaa aaaatatgat 148620 
gggaaattta ggactgecta cttgttatct 148680 
aaaaatacaa acatgttgcc cttgtttatt 148740 
catttctgtg taaccacaag cacagttggt 148800 
ccagattgcc ttctctacca tctaatttct 148860 
aatagaaact ttgaaattat attttgetat 148920 
tgttgtcaaa tatttatttt ctgeagaata 148980 
cagtgacttt cttgattctt attcctctaa 149040 
ctttcattta acaagtaata atttaaegtc 149100 
teagtatgea gaacaccaag ggattcccac 149160 
caccaatgct ttcttggcct ettaacgett 14 9220 
ctgccacaca ctctcatggc cacacacagg 149280 
ttgaactatg aaagattagt atetgeaate 14 9340 
ttccccaatg ccttctattc agecttgaac 14 9400 
ctcattaagc cagccacatc attcgatccc 149460 
tcttcctctt tttatcttca gtcactctgg 14 9520 
tttcagtgtt cctagactcc tgttgttaga 14 9580 
gactgggggt ccctggatac cgagggtctc 149640 
gagatagaaa caagaatctt agtattattc 14 9700 
tagttttccc ccataaatct tgatttttag 149760 
taaacaattt tgtctccaga ctttctgcca 14 9820 
gtgtctttac gtactcaatt taaaatcctc 149880 
ctgagattga gaacctctct ctgggctcca 14 9940 
gcataatttc attttatgag aatacataga 150000 
tacaagacat acacgggacc acttggggca 150060 
gtggatattt ggtgagtgta ttaggccgtt 150120 
tgcgtaattt ataaagaaaa gaggtttagt 150180 
ctcaggaaac ttagaataat ggtgaaaggg 150240 
gagacagaga gagagagaaa gaaaaaaaac 150300 
taactcactc actgtcacaa gagaagcaag 150360 
cccactgtgc cccttcacca acaggtgaag 150420 
atttgggtgg ggacacaacc aaaccacatc 150480 
gtccctagtg cattgaaatg taagtctcat 150540 
ttgaattcaa gatggcagac taaacatatg 150600 
taagttttaa aeggattata taaaagagca 150660 
gccacatggg aagtatgeat gagagttaga 150720 
tcagagaggt tttgggttca taggtgtgca 150780 
aaggegaata atggaaggtg aatagttgaa 150840 
tcattgtcaa agaagagatc aagttggaat 150900 
atatttaata gtaactgagt gagcagectg 150960 
ccccaaagta aaccccttgc attttatcag 151020 
ctgacttaag cttatttttc attegtaagt 151080 
ttctatatgt agagttatgt gagaattaaa 151140 
tctactgtac atgeaaacat gtattatcaa 151200 
atgaagtcca ggttctctgc cttttatgtc 151260 
teaagecaga gcacaaaata ctcaaaggac 151320 
tgcactaaaa taattgaatc agtgattgga 151380 
tcaggctctt gaacttccta ggtccttata 15144 0 
tggtggctca tgcctgtaat cccagcactt 151500 
ctagagatag agactatcct ggccaacatg 151560 
aattagctgg gcatggtggt gtgtgcctgt 151620 
agagtcgett gaacctggga ggcgggggtt 151680 
caacctggtg acagagegag actccaaaaa 151740 
atatggaatt ctaggcatct cacaaatctt 151800 
ttttaaaaat tctaaccgct agaccactag 151860 
ttatttaacc aacctctggt ctcatcagag 151920 
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agaaaaatag atggcatgtt tttacctttc 
ctagtcaacc atttccaggg tgcacattct 
aacctacaaa atttattttt ttaagatcaa 
ttatactgtt aaagcaaact aaatatggcc 
accttgtgga tgaaccgcaa cctaaattaa 
gagttagcac ttataacaat agctgggtct 
ctcatacacc gctgagtattr caaactgtgt 
gtccagctgt ttctgtacct cacttctgat 
taaatttgtt ctgaccacaa ggcatccctg 
ggctgcccaa ttcatgaatc gttcattgct 
aaattttcct tttaacaata ccatctctcc 
gattcagaga gattgttttc .caaaagatag 
ggcagagaaa ctggtatagc tgtacttcta 
agcaggctct tatgcaatgg aagaggacaa 
tacaaatatc aatctgtctc cctttgcatg 
catttccttg caatatcagc tctctgttgg 
ttccactatt tttcccccaa attattaagg 
ttaagtggaa gtggaacttt cttgctataa 
tccaaaggag tctgtatctg caaaagttct 
aaaaaagaaa aaatccttgt ttaactaaat 
ttataattat aagaactaag gaaattgtct 
gagaatatat ttcagcaaaa caaggaaaaa 
acagtgcctc ttaccccaga gataagtgac 
tagaagccga atgtataaaa tatatcatgt 
ggtcattctg attcttgacc ttgatctgag 
atgacatact agccacgcct ttattttcat 
atatatacaa ttattttatc ttcctaacca 
aacaataatg tagataaagt atcttatgaa 
gtgtgtgagt ttgttcatat ggtcttatag 
ggttgtacac agaacattta aaaaagttct 
aatgtttctg cattttataa tcattaaaaa 
ttaaaatttt tatattagga tatcgtaggt 
atctaatttg attactatgt aggttatgca 
tttaaagtca taatgatcat cagatgaaat 
taagtagtga ataaacgaga caagagtaat 
tcaatggata actgcgtggc tccggttctg 
agtaatatga ggaaaatgaa ccacattttt 
acactttact attaaacttt ttttcaaata 
gaggacccat tccagtttaa aggagactca 
tgtaacataa gctatcaaag acattaactt 
tagatcaaat aaaggttttc tatcaacatt 
caaagaggat gcccttgtag ttagtactaa 
cacttggcag tgagtgcgtg ggcgtacgag 
atgcaacagt cagtacatcc aggaaaagga 
gtacaagcat gcacaccaca cagtgcacac 
aaaaggattt aggggagctc tgagtgtgtg 
acggagatga tgcaacagtc agtacatccg 
attgttgcaa tttttctgta a'gcttgaaat 
cccaggaagg attggcacaa cattatttta 
ttgaaagtta ctggaacacg ctgattacac 
taaacttaga cgtaagtgta agaacccagt 
gttactacac agtttccttt taaatggaaa 
gttggttttg aactacttgt agcccttcct 
tttcatgact ccttcaaata cggaaactat 
cacataagtt attagattct tttaaaatta 
gtctcactct gtaaccaaga ctggagtgct 
aattcctggg ctcaagccac cttcttgcct 
tgagctacca tgctgagcat acaaattatt 
aacagaagtg atcattataa gttaaatatt 
aaaggattga aacaataaaa tattattggc 
tatagactgt catctaaaca ctcaaataga 



cttgtttcag gcttctgatt agaattccta 151980 
taatatttct caaaatttat attaggctga 152040 
agaatgtcaa atattagcaa gtcacatggt 152100 
tgagaaagac tttgtacttc tgtatttgag 152160 
taggcagaga aaattttaaa cttaacttag 152220 
tggtaaatcc cagcagccat acttcagcca 152280 
tcaaataagg caaatgccaa tctgtaacca 152340 
tcctgtatgt cacgtcactt tatttgtcta 152400 
gagtctctct gaatctgctg tgattctggg 152460 
ccattaaact cctttaaatt agtttggctg 152520 
tgggagagtc aaactcatat ggcaggtgag 152580 
aaagtaaagt tgccattttc ttaaggtcta 152640 
ccatattcta tcagttacac actgtcatgg 152700 
aggtggcctc atcttccctc catgctctgt 152760 
ccctgtcttt ccttggattt taggctattt 152820 
attcaagaat agagatgatt ttgtagtatg 15288 0 
tattagtaat gctctttcat gttccatatc 152940 
cttttgaatt agcaattcta gactctagac 153000 
gagggaaaat aattgtgaag ttaaaattct 153060 
tatcattcta aaataacaaa aaatgatatt 153120 
ttatataaat gtctggagaa atagatttct 153180 
ataacacggg gatatactgg gactgaagta 15324 0 
atgaggtccc agaatcattt taagcaggct 153300 
aaaatataac gtgacacaga gggcacttag 153360 
tttcaatgac gaaaatagat tggccaaaga 153420 
tacatacact ttttactcca tactgcacag 153480 
ggcaataagt tacttgccca tcttcttatt 153540 
agtagagaaa tgtagtcagt tatttgataa 153600 
gaaataacaa acataattta aattggtaca 153660 
acctaatctt aaaaccatgg atttttactt 153720 
caatggaaat tttgttgagc ttaatatgaa 153780 
tttgccaata gaattctaaa acttaaatat 153840 
ctgtgactca ttagaatata tgatgtgttg 153900 
atggccatct ttaaagttat ctcatagatc 153960 
gtaaaaatgg ttgacattat caatgaaggg 154020 
atactcagaa ggaaaccaca tcacttaagt 154080 
aggaaacatc acacaaactg taaaccaaga 154140 
tgccaatgcc atgaaaaaaa acaaagagct 15 4200 
agagacatta cgaccgaatc ctctatgtga 154260 
ggtcaattca taaaactgaa atctggaagg 154320 
ctgtttcctg aaacggtagc ttgtggttaa 154380 
gggttgtagg gtccgtgtgt cagtgacttg 15444 0 
catgcacacc acacagtgca cacaggggtg 15 4500 
tttatggcag ctccgagtga gtgcgtgggt 154560 
gagggtgacg taacagtcag tacacccagg 154 620 
gatgtacagg cttgcacaca cacagcacac 15 4 680 
ggaaaagcac ttatgggagc tctgtgtact 15474 0 
tgtcctcaaa taaaagtgaa aaaatgttac 154800 
gatgaatact tcgaaaactt cctttatttt 154860 
atttgggatt tzaacccctct gccttaagta 154920 
ttaagttatc ttgagaagca caaaatttga 154980 
tgaaatacat ttcaagtcat actaatatat 15504 0 
tagaaaacta aaacaaaaac tctcttttaa 155100 
gaagatttat gtcttatgaa gtttatttta 155160 
ttattattat tatttatttt tagagacaaa 155220 
gtggcaaaat catatctcac tgcagcttca 15528 0 
cagcctcctg agttgctggg attacaggca, 15534 0 
tggtctgatg aaataattat tgataaaatc 155400 
tacagtgtga tttagagaac atacctgtag 155460 
ttggaatttg tatagaattt tgcccttcaa 155520 
agacattcaa atttacaaaa tggtgcttta 155580 
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ttggccagcc tagataattt ttcttgcttc 
cttaatatca cagacaactg aaattcatct 
gattttattt tctgatgcta ccacatgaaa 
aattggtgtc tttgtatctt gctaattaag 
ttgcacttaa tgagctcact tgttgtatta 
agttgaattt cttcaggagt ggctttggaa 
aaatgtgtac attacaagac -aaaaaaatca 
tgtagactca tactgacatt ttaaataatg 
gattgattca aaagctcagg caatcacaat 
atggtacaac atgatgctca taccagaaac 
tagttaagca atgtgcccct tggactctaa 
tctaaaattt ttttaagcca gtggccaggc 
accagaattt atttgacaaa aagtggaaaa 
tttaatagta gaattctgtc gttcatcagc 
gagaacatcc agatgttatt gattccaatt 
ttagctttcc aacttgtcca atatcatcag 
ttttgtcttt ggttgagtaa tggattccac 
taggaaatct aagcatttga tgtcaaaaat 
aggtaaagca aacattcaaa attcaaaata 
acatatccaa tcgacattga aattgtatat 
aagtcataat aatatgcaat atttcaggta 
taccttgcct gccccagtgt tagtaacatt 
atatgaatcc ttaaaaaatg caataactac 
ctgagggata gcactaatat ttccaggttg 
tggaaacagg attgtaggga gactgtgagc 
atcttttgat gaaatgcatt ggggtaaaac 
ttagaaaaga taatttttct cctacatatt 
agccaactaa acctctatag ttattggtfct 
ttatgactgt taagagcata aaggggaaga 
agacagcaga gctgtgagaa gactgtttgc 
cctttgaagc agagaattac aacaggaaat, 
agtacaaaat tgtgtcagca caccaaaata 
tcacaggtgg tataataaat gaaatcagca 
ttttggctca gccgaccatt agttatttgt 
aatatttatg tacagctcca gaatccacct 
gctgtggggg gcatggaaaa tatttactct 
aatgtgctct tggcacggga ggtcctggaa 
ctttatcatt ttcttttttc cttcaactga 
tagtaagtgg tgatactgcc agactcatcc 
ataatatttt ttattagctt acttgtagag 
ctaaacattc ttttgccttt! gtagaggaat 
aaacagtaaa attaaccagg tattgaattc 
ttttacaaaa aaaaatactc aaggagtaaa 
cacctacgtt catctttctc tggatgttag 
agtgtcaaag atttaatgct gcaagagaaa 
tgtcttctta tttatcaaat aatatct'agc 
aagaaggaat tatagcggtt tgccacatta 
gtagagaatc ttcttaaaat tttactccac 
gagatggttt ttccaggctt agattagttt 
gtgacacagc agttaaatta catcatgagt 
aggaaaactg aaattctgca aaatgaccta 
tcaagccgtc cccttattag aaggtaaact 
■ tgctcagaaa tgattaaatc caacaggctt 
gggagttatc ataaagcaca gatactgggc 
ccagtgattt tgaaagtaat tccctgtgat 
ctctcaattt taaaaagaag tttttttttt 
aacatgcttt agtgatggag aagaatctgc 
atcatgtttc ttattgtttt cctgtataaa 
aaatcattta aataccctta aactgtttac 
ggtaatgtta tttaaattaa acgaggtact 
aggccttaaa aatagctatc tcatattttt 



ctcagccccc tttttactta ctcctgaggg 155640 
gcactctact tactatgtgt aaccagctgt 155700 
aaaaaatttt tttcctgagc tatagatgtt 155760 
ttctagcaac accttctccc tcttcctcgt 155820 
atttataaga taattagata aaacaaaaaa 155880 
agtgcccctt aaatattcaa acatggataa 155940 
ctttaagaag aaatatactt aatgttagcc 156000 
tgatgatcag ggcttataca aaaaggaaga 1560 60 
tattttggtg taagcaaaac aggatcacat 156120 
ctttgcaata tttaagcaca acaaacaaat 156180 
ttaccatttt tatttgcatg ttcatttttt 156240 
agtattaaat tctttgctgt ttatcatgat 156300 
atcattgacg tacagtgaaa ctgatcattg 156360 
agttagggaa ttgtggaaat tgttttgagt 156420 
ttctggtcca actcaaactc atttgggcca 1564 80 
aaaaggcaat ggcatctctt actgctctat 156540 
ttgactggca tatatattct ggctgccctt 156600 
acacatcttt taaaatactt. cttaatattg 156660 
tcaagctgta aatgttaata ctaaacagga 156720 
aatgaacaag aaacaataaa ttaaagtgga 156780 
ttctttgata gatgtaatga cagaagctac 156840 
gagctttcca gctaggatta gagcattttt 156900 
tcctccagga gaggatgctg tccctgcttc 1569 60 
tctgctttac atcaagactt tgctttagga 157020 
tcaacagtaa gattagtttg tctccatttg 157080 
ctcttcccag tgtcattagt agctcatgaa 157140 
caagaagaat cagaaccagg taattgacat 157200 
tgttctgact caaatgctaa cttgaatccc 1572 60 
gaatgcccag ggaggcaact ggagagacag 157320 
agaaaatatt cttcccatgt gtggacttgg 157380 
taggaatagg ctggtgtagc ctcataaaca 1574 40 
tttcactaga ctgcccttcc cctaggaggt 157500 
atccttctac atgtctcgct tatatggtta 1575 60 
aagctggggc ataaatacag acactagttg 157 620 
tcactaattt gtctacttat ggggaatata 157 680 
gggagccctt aaatgtaatg catgaatatg 157740 
ctcttggtag gtgttgggtc cctactgacg 157 800 
atataaaact gaaaagccgc ttttaacagt 157860 
tataagaagt tagctaaatt ttatccagac 157 920 
tccctcctca gagcccaact aaacgtggtg 157 980 
aacattctga gactcctttg ttttttcacc 158040 
atactctcat taaagtcatt gttaccattc 158100 
gaaacataaa aattgacact aaaacaatat 158160 
atttggaatg tatgcataga aaatggtctt 158220 
tcatctccaa aagaatatct cctttaattc 158280 
gaccgcactt ttcttttgtt ccttttgtat 158340 
gagtacttta aaagtgcttc ctaaaatttg 158400 
cctgcatctt tctggatatc tatcactcct 158460 
tctaaatctc tcagctttct tttatctcaa 158520 
ttagcctgga ctctcttgag tcccccaaga 158580 
tatttaaggg tgaaggaatt aagagacatc 158 64 0 
aggaatcata aaataatata tttgaccgag 158700 
gccaacagag ttgtaccgta gaattgccaa 1587 60 
tctgcctttg accttctggg tagataaatg 158820 
tctactgaat ctccacaaat cttgtgtaac 158880 
ttttcagttt aagccattta ttcaaagtca 158 940 
ttttaaatgt attaaaacaa attgatagga 159000 
gttagacaaa taggatattt ttgaagttat 159060 
aatatcgttt gttatatttt . tatcttctgg 159120 
gagtgcttgc tgaaactaac tagaaatctt 159180 
aaaagaaaca tcaataattt ttatatattt 159240 
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ttaaatttat gagatggaaa gttagctttt 
atgaagaaaa tgaagagttt agaaatattt 
aaaaaaaaac cctcatctat atgtgtgagc 
aatggccttg gaaaatttca tggaatgaaa 
gggtcttgtg gttgtggctc agatcatggg 
agtagcaatc cagtcacaag atagtaccaa 
tttcttaaaa gaaagagttg- -ttaggaatga 
cagatacgat actattctga aaatggtgtg 
tgtgctagaa atctgcatcg ggaagaaata 
agactgtgat tcaaataaat aatgtgttta 
ccaacatttt gggaggctaa gccaggtgga 
tggccaacat gatgaaatgc catctctact 
gggcacctgc aatctcagat actggggaag 
aagtggagat tgcagtgatc tgagatagca 
agactccatc tcaaaaataa ataaataaat 
aaaaataaac ctgaagttag gcagtgcaga 
ttttcctgtg tttttgaggg tttctcctgg 
gcccttcaga agtcgggaca tcatcacatc 
atagaggaat gagccatata caacatacat 
aagctaattt agtaactaat attgtagttt 
ttgatagaat gcagagggca ctttaaatca 
tatatacctt gatttattag agatagatta 
cttgcagtga taagttattc tttttcaaat 
ttcaaaaatt tcacggcttm gaacaacaaa 
ggctgagttt tggctactct tgacttggct 
caagctggct gagcttggct catcacgttt 
gaagcccagg cagtggcatg ttctttaaag 
cttaactaca taagcwcact tgatgaagtc 
caaccatcaa gccatggtaa agctgtcgat 
atcattcagt ctttcacaaa acactggcaa 
tacaaaaacc tacttaaaac ttaagggtgt 
gaatttaagt tataaggact ttcactgcag 
tttggtcatt gggaaagtgt ttttcaggta 
aaaaccaatg acttccaggg taggctggaa 
acttggaatt gagatagaga cgctaaactg 
tcatggaata cactcattaa ttatcacaga 
tttgatgtat aattggttct taacttctct 
ttatcttcag gtattataag gcttttaaaa 
tttgtttgtt tatttgtttg ttttgtcttg 
ccaggctgga gtgaagtggc ttcatcatag 
gccatccttc tgcctcagtc tcccaagtag 
tttattttat ttttttagac atggggtctg 
tggcttcaaa ctcctggact caagtaatcc 
gacctttcaa agtgctgaga ttacaagcat 
tatggggttt tctatgcttt cacatataat 
atgtttgggt tacttttagt aaaccgtatg 
atttattttc aactgctact tcccttctct 
tatgttgtgt cacagagtct gaattgacac 
cggttggctg acatagagcc ccactcactg 
acaaaccaac cctaggcatt agtatcaaat 
ctgttgactt ttatgaaact gagacaaaaa 
tgtaaaaatg acatgaattg taagtttcct 
aaaactatat aaaattataa tttttttctt 
tttacctcac ttttagaatc tcgaattcta 
caccttttaa acatttgatt cttataactt 
atagaatatg gatgctccag cagctggttc 
aattatatgg ttaaaatgtg aagcctgagt 
aatgcagttt ctaaagtttt caagaagtgt 
ctgatacaat gtcattacaa ctgagaattt 
atgcaaccct agtgacattc tcattacaca 
aacgtttgga aagaacaatg tttaaatatg 



actaatctta tcaggctaaa ctgagtaata 159300 
tttggaatat cagtgaaatc tctaaaaaga 159360 
gacataaagt cacactttgt tgaaatgcaa 159420 
ctaaatggca gcacatcact tttccagatg 159480 
ggtgatggca aactcttcca aaggaaaaat 159540 
acatagactc acagatgagt aatatgtcag 159600 
gagtaaggat gtgcctcgat tacaataccc 159660 
ggggttttgt gagggatgcc aaccacacat 159720 
ttttttgcta catgttacaa atatgcctat 159780 
ggccaggcgt ggtggctcac gcctgtaatc 159840 
taacctgagg tcaggagttc gagaccagct 159900 
aaaaatacaa aattagctgg gtgtggcgac 159960 
cttgaagcag gagaatcgct ggaacctggc 160020 
ccactacact ccagcctggg tgacaggatg 160080 
aaaaataatg tttttataat tttttcatag 160140 
cttgaatttc tcattctcaa ttcccattca 160200 
aaagtaaaac aattcaatgc tttatcttct 160260 
attttacctt taagatattg gactggattc 160320 
aggagtatga agaaagggag acagccttca 160380 
aatgtaatga gaaataaaac tagaagatga 160440 
gagagacgat ttgaaagaaa gaaatatttt 160500 
aaaagttcgt ttaaaaaacg tgatcaagaa 160560 
atcatgacga taatgctgca taatcaccca 160620 
tattactctg gagctctctg ttctgtggtt 160 680 
gagctgggag gatggatctg cttcaggctg 16074 0 
cattacgagg cacagactga agtgtttgga 160800 
gcactgagag gagcttatga atgaatgaag 1608 60 
aaaatgcaag gactgggaag tacatgccac 160 920 
gtactacggg ggagtggagt gctggtacga 160980 
aatgcaggtt attcagacag aatcgttaaa 16104 0 
attagctatc tgtctccaaa tgtacttttt 161100 
aaagtatgca ataatgaaga agataattgc 161160 
gaagtgaatg cagaaacaac cattccaagc 161220 
agagaattct cttaatagaa aagaaagagc 161280 
agctactcct gttggggctt tgctgatcct 16134 0 
agccacctat gtactctgca ttcattggga 161400 
atgtttagca ttttgcttag aagttagcac 161460 
tgacattttc ccctctagag gaagtatgcc 161520 
tttcttagag, acagggtctt gctctgttgc 161580 
ctcactgtag ccttgaactc ctggactcaa 16164 0 
ctggggctac atgcttggct aattttttat 161700 
gctatgttgc caaggctgat ctcaaactcc 161760 
tcccacctga aggtzgggagg ccagcacctt 161820 
gagccgctgg ttacaagaca tggccagaag 161880 
acatacttag gttttctcga tgatgaaatt 161940 
tgctttgaat taataaaaaa gttgtgattc 162000 
tttggaggtg ctccgttaaa agattccctt 162060 
aaaagcattt ccaggcatta ttcttgttcc 162120 
atggcactgg ggaaggacag agcctgttgg 162180 
ggacaaaagg atcctaagac gtttagacaa 162240 
tggtaagaaa tttagataaa tatttccttt 162300 
acactgtaaa ttagaaaatt taaattatat 162360 
aatattggca tttttatgtg tactaaggta 162420 
ttgggagtgt attattttca actcataaat 162480 
ttccttctca ggaaaactga acctatattt 162540 
aaaaaatgat tccgtagtaa taatcacatc 162600 
caccgaaaag ttaggtcatt taatttgttc 162660 
attagattta aaagagactt gaaagggaat 162720 
tctcaccaga attcttgaaa gtactgaagg 162780 
tgtagatcag aaaccaagaa gtctatgagt 162840 
aaaaatgatg tgactcatga ttgaaaaaga 162 900 
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aagtcatgta aattggaagt gcaaaaactt 
gtagtaaaat aaaaatgaaa gtaaatatgg 
atagataaaa ccaatgagaa tgggcagcac 
attttgatga attacttggg aaatctggta 
agttaataat tccactttaa aaaattcaca 
tgcacagctt ctatattaat gtgacactgc 
gatttgttca gttgttccta- gagggtgtgg 
ctcatttgtt ttagttttta tcttttaaat 
ttttcttctg agatgccaat ctaaatctgt 
ctgatttagg caattaccat ggttagtaaa 
ttttgtggac atgatcaggt catctttttg 
accactatta ctctgattac ttttatactg 
caatgaaaac cagaattcag aggaaatgta 
tcttcattca gtgccatgct cacctggaag 
aacagcagtc atattccttc agagaatttt 
gacagcacaa gtcaatccta atgtctttgg 
aacaagacaa acatttttgc caatgtcatc 
atttcaatgc agatagtttc tgtccagctg 
tgtagggctg gctctcctca taaacttgag 
gttcttcctt ggcatccact gaaaaaagag 
tgggtattca gtcgcatgtg cattcttaaa 
ttaactggct cacctaggat cctatacttg 
aagaaaggga ggattggttc ctctacctca 
tgggaatgca tgttgagagg caaacagtaa 
gccctcaata ttttcttttc tttgccatat 
cattaaatgc attgcccttg gataaaagcc 
tttgaacctt tagccatatt catgtctacc 
ttccatcaga gtaaaatact tcgcaagagc 
cagatttata ggatgtggtt ttctcatccc 
tgcagatgat ggagaaagtc ttcaaagaga 
taaaacagca tagatgttca gagaactgca 
agagagaaga ggcaatgatt ggtagaaatt 
aaaagctgaa gatgtatttt atcacgaata 
gcagcatttt taccttattt atatgagaat 
gggaacagga agaatatatt attgccttta 
gcctttagtg aggcaatgct gaggctatta 
tgaagaaggc agaagtaggt ttcttgaggt 
aggctcttac gactgggaga taattaaatt 
catgacctgg ttttctgact tagtgtcata 
atcgtacctg aaggaaaatg cttgtagagg 
atgtgtacga gacatttcgg tgggcaggtc 
agccctgtag aagaatctag gcattcagtg 
cagtcaaaac aataagagtg gctgagatca 
gcaaatagaa gacatttgga tgctgcccaa 
ctgtcaagga gctgaagaat gaaaggtcag 
gtgatcagcg gtaaaattga atgaacagac 
gagcttcaaa ttagaaattt tttaagaggc 
ttttgaaaga tgatttattg attctttaat 
aggtctccaa gtctcctcta ttttcacaga 
acctattcaa ggttacacat ttggctgaca 
tatttcagtt catctgaaat aaatatcctc 
gagcacatac acctctttac atgaagtcct 
gtatcacttc ttcagacatc cctctcttca 
tatcataaat acctatcatt tctctcttgt 
tcatttctac ctggccctaa aacctacaca 
gctccatcct tgtattgaac acaatgctgt 
tgccatctaa atccttcctg agcccacagt 
tcagcatttg acgtgctttt tctgtagggt 
tccaagctct cctttcctgt gtattcagaa 
tcaatactat cagctagtag ggcatacaac 
tttacatcct cccttctcct tccactttat 



tacaggagaa atctatgaga ccaaaaatag 162960 
ttgcctaaaa cacattaaca aatgaaacaa 163020 
taagagtaac agagaagttt gacccaggca 163080 
ataatcaggt aatatgggag cagcatgatt 163140 
cagaaaagga cagagtatat atggaagaca 163200 
tctgagtaat taaacaccgt ttccaggaaa 163260 
atacatccag gacattcagc ttcatgactt 163320 
tgggggaaaa tgatgtgtta ttcactgatc 163380 
tatcttatta aacaataagc agattttgtt 1634 40 
caagaaaatt accctaacaa tggaattagg 163500 
aggtatctgc cttttcctgc attcataata 163560 
tccccataag tagtggagtt ctcacttacg 163620 
accttctgaa ttcttattgt gcaaatttcc 163680 
aaataatttg tcaaatattc ttgccaacca 163740 
cacactaaaa acaactatta gttttacgct 163800 
atcatgaatt tatctacaga tttcagaaga 1638 60 
catcaatcaa gtggcaaagc agatgtgtgg 163920 
tctgctctcc atcttggatg ttggttttgt 163980 
atggtcgcta gcaccttgta agggtatgca 164 040 
tggatatctt tttcttagga aattcatagt 164100 
caatcactgt tgccaataga aaacaatata 164160 
tagacagttt cagccaaata caaagatctt 164220 
agaaaaatta aatcaatgtt gatgcaagag 164 280 
acacaacaaa atgataaatc agatctgctg 164340 
cagaaaatct tctatgattc agagctcact 164 4 00 
attggtaaaa taaccaacaa gtcatcctcc 164 4 60 
tacattatgt atatttcttt gcttcccaca 164520 
cagccatgtt ctaggacata tggacatgaa 164 580 
aactcttgaa ggatgaatgg ggatcatcca 164 640 
agtgaagatc aattctaaca agacgataac 164700 
agaagcctca ttgcttaacc caaaagnatg 164 7 60 
agatcttcaa ggatcttgta ttagcacaga 164 820 
gagtaccaac aaaataattt tcatagatga 164880 
tccttctggc cagagtgtgg catatcactt 164 940 
gtgaggcaat gctgaggcta ttatattatt 165000 
ccaaagtcca ggctagaaat gaaacccaaa 165060 
gaaaacagag tcaggtcctc cttaatacat 165120 
tagtggatgg ggaaatgaga agtgggggag 165180 
gtgtgtggaa atgcattaag agaaacagaa 165240 
aaaagagtca ttttaaatat gataaatttg 165300 
taacaggtaa cttacaatac atgtgtgcag 165360 
tctgctccag ggtcatcagt atgcacgtgg 165420 
tccagggaga gtgttttgaa tgagaagtaa 165480 
tatttaatga atgaattaaa gatacagagc 165540 
agaaatagga gcatatttga ggagaagaga 165600 
accaagaaag ccaaagccag ccatgtagaa 165660 
attcttaaat gaagactgtc ctttattttg 165720 
cactcaatat tgcaactggg atttcttcag 165780 
tcaaaattct caggtgcaga gtatgtcagg 1658 40 
actgtgtcaa gactaaaatt tcatgatcgt 165900 
caaacataaa ccataaactt tatgttttaa 165960 
ccagctttgt aactcctcca ttcttatcag 166020 
gtcattcctc tcttttcttc tcagcctgta 166080 
tgtataattg actccatcat cttttgttct 166140 
tcagatcaat cctatgttct ctctttttag 166200 
aattatgtga attggtctaa tttgaattct 166260 
gctgtagaga aaatatgttt ctattctggg 166320 
tatacattcc ctgattatta aaagtttatt 166380 
tatacctcac atgctgcttc acagagacag 1664 40 
tgtggccctg tcatgcctaa atcatatcaa 166500 
tccccttttc tttgtacaat gtcaatctgt 166560 
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ttcaaccttg gctactgctt cagacatctt 
atctccaaca tcttacctct gccaaatctt 
tatttttaga tgatgtaaat ttctataatt 
tcctactcct cctcttatag tcgtgtattt 
gttgccagca gaaaacaata tattaacagg 
acagaatatc cttgatactc tgtgtatgtg 
tgtgtgtgtg tgtgtgtgtg- tgtgtgaaga 
gaaggtcatc aaaagttaga aatttaaatg 
gcacctgcca aagtgttttc tcatagttca 
tagtccacta cacagtgttc acctttaatg 
actgatatta acagagtgtt atacagcagg 
taagttagtc atataataat atacttaaag 
catgccatca taattgttgt acaatcctga 
aaaaatcttt actgagtgaa ttctatctga 
ttggctaaac cctattagtg atgtgggttt 
taagatgtgc tatatattac agatggactt 
ttaaccacaa gcctccaaat gaaattttta 
aacacagcgg tatgaaataa tgcagtatca 
ttctatatta aaagatagat atgtaacaaa 
ttaaagtata acaataataa aactaaaaaa 
atcttattaa tgatttcaca atgatgccta 
atgaataaat atgggatttc ttttcaaaaa 
cagaaataaa tacggggtaa agtttaataa 
gcgttggtca gaatgaaact ttcacagagg 
gaaataattg atatcagtgt ctctaaagta 
gaggagcctc tgaaatcctg agaatggatt 
gccagctggc atgcatgggt ggttggcgag 
tttgtctcat gtactagttt aatgtgccgg 
gcctgaatca gcagttctta gcaactacat 
actgtgactc caagatttcc ctgaataagt 
ttccagggtg tatattctga atatattctt 
acatagacct gatgacattc agtattcgag 
taaagagcaa aaatatgttt taattatttt 
tagagtttcc tggaaactca agtcacggct 
atgaaccaat atagaactac actgctttat 
tgatacagct ttttaagtgg tttttaaatt 
taaaggaatg aactaggata gctattcagg 
ctcattatgt catgtctaca ttcttgctgt 
catgatttaa gcactcataa aatattacta 
aattagccct cctctaatag cccaatagaa 
ccccagatat tgaracctca atccgagttt 
tctctattgg agatgctaat agattgtaaa 
taaatggttt gagagtctta cttttattct 
tgcatctttc gtctgcactt ctgetctcca 
atctttcaca cttttattct acatctaaaa 
aattctcatt tgtaatattt atcaaaataa 
ttttaataca ggtttctaac atgagactga 
tctgaaaaac tgaaatacaa tatatacgca 
agtatttgat tgcgtaacag gtgacaacag 
aactgaaaga gtataattgg attgtttgta 
ttgtgatgat ggatatatgc ccccatacaa 
attttcacag gcagaggacg gtggatcact 
aacatggtga aacccgtctg tactaaaaat 
cctgtaatcc cagctgttcg ggaggctgag 
agattgcagt aggtggatat tgagccactg 
caccttaaaa atatatgtat acaaatatat 
tatgtatgta tttatacata taaatacata 
caacatgtct tcaaactgaa taattttagc 
agatttaaaa tttcttcctg ggaagtaaat 
aaatcaaaac ccatgttctc cataaattaa 
cggtcaggtg tctttgggga gtttagaatt 



gcattgtcag tcattatcct tctcttgtgg 166620 
ttcccttcat cagtaaccat aatcagactg 166680 
ttatgttagc tctggcttat ccctctttct 166740 
aggtcacaag tacactctta aacaataact 166800 
caattatcaa gtatgaactc cgaatttcca 166860 
tgcttgtgtg tgcgtgcgtc tctctctgtg 166920 
caagtttatg acaagaagga gagactcaca 166980 
agaatacgcg tgaaagtaat tttctagcct 167040 
tgtgaagaaa taccaatagg ttgttgtcgt 167100 
taggtggcat gggcctaaac aaggatggaa 167160 
tgctgtacta agggctttat gaggaagggt 167220 
ctttaatttc atggtgggat agttcactct 167280 
attaaagaac ttttcaagat gttattaatg 167340 
ccctcccagt caaaaggctt ctactcaaat 167400 
ctgactgctg ccgtaacatt tatttggaga 167460 
tgggaagtga ctgagggtga cagaaattaa 167520 
tgctggtgag taagaaatac aaatttctga 167580 
atggttaagg aagcttccta ggttttgaag 167640 
cctgcacgtt gtgcacatgt accctaaaac 167700 
aaaaacaaaa aaactttgat aagagagtac 167760 
atataattta gtgtcttggt aatttaggta 167820 
atataagtaa tttttaatga aaataatatg 167880 
ttaacaaata ttccagaagg ccagtagtct 167940 
ggagagtaaa agtgaagcag ataaagtaat 168000 
agaccgagga atttgaattt tgtaggcagt 168060 
caacataagc aaaatcgtat tttggaagca 168120 
aagtctccag tgagcaggtc attgcaggtg 168180 
caggggggtt gctgtgaggg aatttaagaa 168240 
gaggggatta agacatgtaa aagaaagtca 168300 
tatatcttat tccaacgcta ttttgaatat 168360 
gatatattca agattcaacc tgagtaagtt 168420 
gtcagataac aatgtaactg aattctcctg 168480 
acaagtactc aacatattaa ggaacttatc 168540 
gttgttgatt ttattaatta ggaagtttac 168600 
atgtataata ggtcctattt ttgaaatggc 168660 
gagacaactt gacatctagc ttttaatatt 168720 
tcatttgaat attttgtgca aattctcaca 168780 
ttcatataac tcaaccctcc ttttgaatat 16884 0 
aagaagcctt gaggctctaa gtaacgcaat 168900 
tacataaata attttc.tccc actatttcct 168960 
taaaaactga cctatactag gtactagaga 169020 
ctgcatctag aaaacattaa accagtattc 16908 0 
gggtcagcca actttagatt cttttgtgaa 169140 
aataatcccc tgtaacagtt atttttttta 169200 
ccttgacagg atttgttaat ttatcttaaa 169260 
attagtttcg agtaactata tcttgatgtt 169320 
aatctaaaat at tat cat ca tccaaaataa 169380 
catataatat agttacagtg aataaaatct 16944 0 
tcgacagtaa tatattgtac attttaatag 169500 
atacaaagca aaaataaatg tttgtgatgt 169560 
agtgctgtgc ctcatgcgtg ttatcccagc 169620 
tgaggtcagg agctcgagac cagtttggcc 169680 
acaaaaatta gcagggcatg gtggcaggca 16974 0 
gcaggagaat cacttgaacc caggaggtgg 169800 
cactccagcc tgggcgacag agcgggactc 169860 
atacatatat attatacaca catataaatg 169920 
tatttacaca cacagagggt cattatgtaa 169980 
catcttgtca tataaagcct ctaacaccac 17 004 0 
attaattaag gaaatcccaa cttcttttac 170100 
catcataaaa cattttttgt tcatgtcaag 170160 
acctgatacc tctgttcttt tatatcaact 17 0220 
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ccataaggaa 
atttaaaaga 
atatatttcc 
gcgtatttct 
ttttaccctt 
gaaaaaagta 
tccttggcca 
ttcctatagg 
agatatctgt 
tgcattattt 
caaattttac 
tatttgttta 
aactttctga 
ctaaaaataa 
tattgatttg 
tttattaaag 
gatcatggtg 
cagaaagaaa 
tggactgtag 
tgttaataat 
ggtgtattag 
agaaaggaag 
gcctcaggaa 
tgggagcacg 
ctctatcaca 
caatcacttc 
gtccctacac 
acaaggctgg 
tggatcagga 
ctaaaaatac 
ggctgaggga 
actactgcat 
aaaaaaannn 
tggaccatat 
acaatataaa 
ggcccaatgg 
ccttactgtt 
ttcttacctc 
tgacagcttg 
ttgggagtac 
cctctgttcg 
gcatcttcat 
aaataaacaa 
ttggtgcagc 
cttccatatc 
aaatgccatg 
tttttaaatt 
atgtctcctt 
ca.tgagagaa 
tcggttttca 
tcccagagtg 
atttatcttt 
ctatcttagc 
tcctatatcc 
aagatagtct 
gattgtaact 
caactgctat 
ggctattaac 
tgtattattt 
attaatactg 
cttcaacggc 



atcccactgc 
agggggaagt 
atgggtttac 
cactcccatg 
ttggatccag 
ggagtcatat 
atgcaggtca" 
acaggggagc 
taaatctatg 
gtaatgttaa 
ctgttatctt 
tttatttttt 
gttttaaagt 
ataaatttgg 
ggtgttttag 
aatcttttgt 
tcatggtgtc 
acaaggagct 
taacacatct 
gtaaccactt 
tcctttctca 
tttaattggc 
acttatgatc 
aagggtgcgt 
agacaacact 
ccaccaggcc 
acagccaaac 
gcatggtggc 
ggtcaggagt 
aaaattagcc 
agagaatcac 
tccagcctag 
gagcaaaacc 
ggactctatc 
aatgagtggg 
tggccggtag 
tcttcaaaca 
tgtttcatgc 
gggcaagagt 
atttccgttt 
gtgaaactat 
cgttaagaat 
gaattttttt 
cctttggcaa 
attttggctt 
gacaatgctt 
ctttattcaa 
aggtatcctt 
gctgggaagg 
ttaggaagaa 
acctgtgcag 
cgacttcttt 
atgacacrta 
atctcctaag 
gaaagtaatg 
ataaaaatga 
actcaatcaa 
gtattttcat 
"ctcactctgt 
ctatcagaac 
ttgtcataat 



attacttggt 
ttaagtaata 
acaaagtatc 
aaaagttctc 
gtggaaggag 
gatggttctt 
Catggtcaag 
aaatactttc 
tttcttaaac 
tacataaaac 
ttctctatta 
agttattgaa 
catggaggaa 
tgttaaatcc 
gagttttagg 
tagaacttcc 
cttactaatc 
tccaacttcc 
accccgtcca 
ctctcatttt 
tactgttata 
ttacgtttct 
atggcagaag 
gaggtgccac 
aaggagatgg 
tcatctccaa 
catatcatat 
tcacgcctgt 
tcaagatcag 
aggcatggtg 
ttgaacctgg 
gtggcagagt 
ttcccctgat 
acaattactc 
cataactttt 
attgctgact 
atctgcttaa 
ttgaaaactc 
ggctttctga 
ggctctcctc 
ccacccttgg 
gcatttaact 
cagatttcaa 
tgtggccaga 
ttgttctcat 
gtgtgtctaa 
tgaatcagaa 
ggcctgaatt 
ttcatatttc 
agatgagtta 
gatatttctc 
ttctgttaaa 
aggtgctgat 
gggtttcacc 
ctgtttcttt 
taatcaagaa 
ctgccaagtg 
acccactgtc 
ctattgaaga 
actctctctg 
taacagcagg 



atgtgtttct 
tcaggattaa 
ttcatctgtc 
cactggtcca 
tcccctcttt 
aaagtaatag 
cctgatgtca 
aatctgtcat 
acacatgact 
atgaaggaga 
tcaaaatgct 
taatgataat 
aaactataat 
aagcatattg 
agcatagaaa 
cagacagcat 
acttttaata 
catgtacccc 
acacatctac 
tctgcagttt 
aattaatacc 
gcaggctgta 
gtgaagggaa 
acacatttaa 
tgctaaccat 
cactggggat 
gtcaatgttg 
aatcccagca 
cctgatcaac 
gcacgtacct 
gaggcggaga 
gagactctgt 
gctctatatc 
aactatgtga 
ttcaataaaa 
cctctgctat 
aaaaagtttc 
tgcaatttga 
ctgagtctga 
actttagact 
ttctcttcca 
gtaggagatt 
acctaacatc 
ggtctaagca 
gtttgttgcc 
gacaaaaaaa 
aggaaaagtg 
atgccagacc 
tagccagggg 
aatggatatt 
ctcccataac 
ctgcatgcaa 
gctcaaattt 
ttctggtttc 
aaccggctat 
actgaaactc 
ccatttacaa 
attctcatga 
ttacattctc 
aaatagaaac 
taactcatac 



gtaggcctct 
tgaaataatt 
agtgactctt 
Ctgactgtga 
gtgacactga 
acatcctttc 
tcagggtggg 
aaaacctgaa 
gtctacacag 
gtgtgtctcr 
ggtagtcttc 
ggtgtctatt 
gaaataaaat 
ttattaacag 
tatatcatac 
ttttggagat 
ggatatttaa 
atcacaaatc 
cccatcctct 
ttctccattc 
tgaaactggg 
caggaagcat 
agcagacatg 
acaaccaggt 
tagaaaccac 
tacaattcaa 
ttctctccct 
ctttgggagg 
atggtgaaac 
gtaatcccag 
ttgcagtgag 
ctttaaaaaa 
catacatgtt 
ttgcaatgaa 
ctttattcac 
agtctttctg 
tatttctcat 
cctttatacc 
agcattatct 
tcttacccca 
gtatgtgagg 
caaaaacgca 
tggagttaag 
ttttcctatt 
ttatgggaac 
aaaggacaaa 
tccctaagac 
tgaactttgt 
cactgcctgc 
ggctaaacaa 
tcggtgttag 
ttcctgtgca 
ttctaactca 
cccacgtgta 
ttctcataca 
aacttctcct 
agtttcttaa 
ctagtttctt 
acctggttat 
catttttcta 
agtccctaaa 



ttagtgtaaa 
ttcaagtaaa 
aacagcagat 
tctttgtgtc 
agtttcccta 
tgctcacatt 
aggtgtgatc 
tgaaatagta 
ggttcctttg 
ttctcatgat 
taatgatctt 
tcagatctac 
ataattttat 
gtaatacttt 
ctgtgtttac 
ggtaacctga 
accccctttc 
atccttcctc 
gaggtcattg 
ctgtaagtgt 
taattcataa 
ggttggggcg 
tcttacatag 
ctcctgagaa 
ccccatgatc 
cgtgagatct 
gaaaacaaga 
ccaaggcagg 
cccgtctcta 
ctactcagga 
ctgagatcac 
aaaaaaaaaa 
tttggctctg 
aacagctatg 
agaaacaggt 
ttaacctacc 
ggtttccaga 
caaagttttc 
gtgaattatc 
tggatactct 
tttcttctca 
acaataacca 
gagttcaagg 
tctgctccgt 
ctcaggtttg 
attagtttgt 
cctggcagcc 
cagttcagtg 
ttctccaaat 
ttgacatgtc 
tgttctagga 
gtctcattca 
gattttagta 
tttcaaataa 
tctctacctt 
ctcactcctt 
catctcttga 
ggttaaaacc 
tctgcttccc 
ggttaaagcc 
caatcttatt 



170280 
170340 
170400 
170460 
170520 
170580 
170640 
170700 
170760 
170820 
170880 
170940 
171000 
171060 
171120 
171180 
171240 
171300 
171360 
171420 
171480 
171540 
171600 
171660 
171720 
171780 
171840 
171900 
171960 
172020 
172080 
172140 
172200 
172260 
172320 
172380 
172440 
172500 
172560 
172620 
172680 
172740 
172800 
172860 
172920 
172 980 
173040 
173100 
173160 
173220 
173280 
173340 
173400 
173460 
173520 
173580 
173640 
173700 
173760 
173820 
173880 
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tatcccaggg tattgctgag gacttcttaa 
gctttatttg gtttacaaag gatatttacc 
actgaaatca ttaccttcaa ataacctcca 
tttagaagat gttttgtcaa caactccatt 
gattttcttt ctcctctgct tcatcactaa 
atcctattgc ttcaattctc ttaatgcctc 
gattacagcc ttacttcaggr ctactgtaat 
aatcttttcc atctgttcgt taaaagaatt 
gctacgatta tattattcac ttcttcaaat 
gaagtccaaa cttcttacca tagtttagaa 
agataatgct gcaggcaggt tgtatctgtg 
ccacactgag gatgctgggc actgcatggc 
tgtaactgta attggaagat tacatgtcct 
cttctgatct caaccatgat gttcttccta 
ggccagtctg tgcccttcca tgcatggatg 
tctcatccca tcatccccca tcgctgcact 
agcagattca. gtttgggtct tgaggccttt 
ctgcttttgt tacactgctt gaaggattgt 
ctgagaaaga gctttaaggg ccgttgcatg 
gagaacgacg tttcaaatag aagctgctct 
aaccatattt aacccagagg tgaaatgcaa 
ctgagatttg gggactattc cacagcacac 
ccatcatatt tgaagatttg aattcaactt 
atgctagcag agaatgctac cttcttccca 
ttctcctcag atgtaaatta tgagcttgga 
aatgcatgca taggtctctt caaaaaataa 
aatgctactt atcactccat ttaccatgtc 
ttaaaaaaat agttttgttc tctccattcc 
ctttttcatt ttgtatttca tcctggcaat 
gtatgtgtgt ccctaaaggc aacagatacc 
tggatattca actttctgga ccattccaga 
agtaactaca attgcatcat tttgctcttg 
accctaaaat gttgctcctt cacaaattaa 
ggtgatctca agatacatct caaacttaac 
ctttacaaat aaaaatctgt cactacaagt 
ttgaatttta gaatatgcat tctcttgtgt 
ttagtgtaat atttttaaga tttgtccaca 
agtcataatt tgtatacaga tataccaact 
tattttgtgg acttatgtat ttatttttct 
gtcatgtggt atgcataaat tgaaatacga 
tagggtctga gattttaacc tacttataag 
atgcaagcag aagacacaaa acttctgggt 
gagcaatcag catgagcatc atgtttgact 
gatgacacag agggcccaga tgggtggctg 
ggaactcttc attttaagat aactggtgag 
tttcctcgtc cctcaaagtg gctcattgca 
aaaataataa aaagtaggag agtttgatac 
cagaaaacgt gtggtattag gagagaccca 
tttacattcc cattagcaac ataggaaaat 
tattaacttg aatttccttg aagaataatg 
tgagcattca tatgtccttt tttgcaatgt 
ggttacatct tcttattttt cagttttaag 
ttgtcagata tgtgcactgc aaatattttt 
cttagagaaa ccttttgaag aacaaaattt 
tagcacttag tgcttttgta ttctaagaaa 
tatcttatcc tttcttttac aagtttataa 
ttttgagtta tttttgtgta tggtgtgagg 
aacttgttct accaacattt atttaaaaga 
ttatgtttgt cctttctccc actgaattct 
tgtgcatttc cagttttggg ctctcaataa 
cttgagtgct gaatatttaa attaagtact 



tctttcactc agttcagact cactcttaca 173940 
tgatatctac caggtatctc aagctgcaag 174000 
tatctatact ttgtccttcg tgttcccttg 174060 
cctcctagaa gttttgtagt aatcatccct 174120 
ccagcctcct tttttctgat cagccacaac 174180 
tcaaatatac tcactccgct ctagtgtggt 174240 
tacttgccta aaatatgacc agattctttt 174300 
tattctagag gttttatttt ctacaatgga 174 360 
aattcaatat cttaccatta ccctctaaat 174420 
agcttttcca gagctggaca catctatcat 174 480 
aacctgtcat cacctccctc tttgtactta 174540 
ttcccctatt accatcatca tggtcctcac 174 600 
aatcctccac cctttcatca catcatgggc 174 660 
tgcctggctc tgtgtttggc aaatggtaag 174720 
actgattggg atgtcacctt tgctcctatc 174780 
tgactcacta gcgttttctc ttccctaaac 174840 
gcactttgtc ttctctctgt gtgcaagcct 174 900 
cacctttcaa aacaggagac ttccgcatgt 174 960 
gtttggctgt ctcccttttc ccttttgcct 175020 
ttcaatctgg tgaaacggca tgaggagaaa 175080 
tgtcagtgag aaaaaagcta ccgtgagcca 17514 0 
tccaacctag ggaaatctga caaaaacaga 175200 
ctctctggtc tcttagcctt ctatcaatgt 175260 
cagtaagaaa agaagaagaa atatcctacc 175320 
tttccttcct gtataagttc tatcagcttc 175380 
aataaataaa atagctaaag atccttccta 17544 0 
cctagtgcta aacaactgtt tataatcagt 175500 
cattttgtga gctccctcca gcttctcaac 175560 
gcatgtaaac accctttaat gttattagtc 175620 
ctttccgttc ccaatttatc agaagactca 175680 
ttttataaaa tgtgaccatc atttgacttc 17574 0 
ttgctttgcc tctgcttcag attgatctct 175800' 
catataactg aatatgcaac attttcactt 175860 
atccccaaaa caaaatatca taatttttgc 175920 
tacgtttcag ttttagaaaa tcataagaaa 175980 
cgggctttga aagacacaat gttttgtgtt 17 604 0 
ttgcatcaga tttgcccttt tttattgctg 17 6100 
gaatccagtt atataaaatt catctacaat 176160 
tagggaaatg tttaacagtg gggttttctg 176220 
aatgcagaac tttctctaag cgactattgt 176280 
ctaacatgtt agtttattaa tgccttatgg 17 634 0 
cagagaaaca ggactttatt atccacagta 176400 
tggctttttg tatcgtccac gtcccacagg 1764 60 
aagattacag gagaggaaga gtgatcttgg 17 6520 
caaacctaaa ctttgtcttg gataaaaaca 176580 
atcataaccc tgagaaatgg cccaggttaa 176640 ^ 
tagatacttt tctagatatt ttgaagtatc 17 6700 
tggaggacga tttcacaaca gctgagtaat 17 6760 
tatttaattt cagtaattct agtaggtgta 17 6820 
gtactgaaca tcttttcttt tcttgcttat 176880 
gtatgttcag aattgtttct ttttaaattg 176940 
ggttctttac atattattaa tacactttct 177000 
ctctcagtct gtggcttaca ttttcatggt 177060 
tatcttggtg aaatgtaatc ttttattttt 177120 
tgtttcccca -ccccacaagt ctgaaatttt 177180 
tatgattttt gcttttttgt ctatatttag 177240 
tggtggtatt taccttttcc aggtggaatc 177300 
ttgtcctatt cctgttcttt tctgattgaa 177360 
gatatttctc aaaaatcaaa taataatatt 177420 
tgttctttct ttatgccaat atcacactgt 177480 
ctagccagag gtatttgatg ggttccaatc 177540 
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cattacattt attattatta ttataactat 
ggtaggctag tcctgtagca accattttgc 
tacaatttga tatttttctt cctttagtaa 
tctcccattg aaatagggaa gcaaattcac 
cttct'tgtcc aacagaaggt gggacaaaaa 
ctaggtcata tgaatccttg aagcttcttt 
acccgagatg ctatgtaagt* ctaacaacca 
aggtgttctg atggacagtt cctgcctagt 
aagaaagtga gaatagccct cttggatctt 
tgtgaattct agaaatacca tgttgaaagg 
ataagtggga taccagaacg gctgttagca 
attccagtgg aagttttcag tgtctctttt 
tcccagcaga ggtgctcagc atactccaaa 
ggttcgaatt cacaaatttc catttataat 
tctgtcattc agctttgtca tctctccata 
ctctgctgcc taaacactgt attaatgtta 
aaaataggga tagtatcacc ttcctctgga 
cgtaaaaaat ttgatgttag gcacagagga 
atgaggactt gatgaaaaaa aggtaaaagc 
atgccagttt ccttcattct tttcctttta 
caacacagcc tcaagaagct catgattcac 
gcttgagttg tagttgtttt acaaaatgta 
caaaacaatt ttgctcgtag agattttaaa 
caattgcaaa gaagttagca tcaccttttt 
ctaactacag aataaaccat gggcggtaca 
ttgtgcagtc cattgatcag gcaggcagtg 
acgttttgct acaaagaacc tagtctaatc 
ctatatttgc aaatcatcat ttttttttct 
tcctatgcag aaaggttttt ttatttattt 
ctaccttgca tagaaacacc aaactataga 
tatggcaatt ttgataccgc tacctacaat 
catggtatca atgcacagca cttagagcag 
cacgcagtct gcatatcagt caatctgctt 
agactcacct gaaatgcaaa gagagaggtg 
acctacactg cagttcttgt tgatgaccat 
agagagtaaa ggtgaagcca gtccctatgt 
gtaaactttg attttgttta agaaccaaag 
taattcatct ccattttaat gccacccttg 
gaatgggttc ttacgttctg gacactgtgt 
gtgagtttca agccaatctc taagtcaaga 
atatactccc tggagcctca ggctgttgcc 
atggaaagga ccaggttatc tctgatccat 
gtacctttcc tggacttgag aacttatagc 
aaagttttag ttgctatgag actggcatat 
tggaaaatta tcagaaatgg gtataaagat 
gaaattagaa aatgcccaga aagacatcaa 
taaatgctgt taaaaaaggc aaagagtcat 
aaactggaca caatgggaca gggcatggaa 
aagctagtat caaaggagat gctgtgaatt 
gctgccccag gcaaattgtg acatatggtc 
gaaggttaat aacgtaggaa atgaattgaa 
gaacctgatg caatgtggta ggcagaattt 
gctttgtgat tatgttatat gcatggcaga 
ttcacctgac tttaacagag ggaggtgatc 
acacgagtct gtactagcag aagcagaagg 
gatatttgaa gcctgacaag gacttgacat 
cataaggaag ggggtacagg tggcctctaa 
ttgacctgga gaagaatgtg tgttaaacaa 
attttgctta tatgct'gaga agacatcatg 
aaggagcaaa gtgttgaggt agaggtgaga 
tccttagtca ggcacagtgg ctatacattc 



aataataatt attaatggtt gtttacagaa 177600 
cataactaat agtgtgatat tctgtttact 177660 
attggagatt gccagaaatt ccttactatt 177720 
ctcaccttga atgtgggctc attttattga 177780 
aaaaaacatt atgttattgg acttccaaga 1778 40 
ccagctctct tggaacactt tctcagtgag 177900 
cctcactgct gtactgggga agacacatgg 177960 
ccagaattcc agatttcctg ccaagatgct 178020 
ccagaccagt tcatctgccg ttagtgttag 178080 
aaagctccat gggtaaatac gcttctgcta 178140 
gctgtcaggt ggaagtggtt ctcgtgtcag 178200 
ccttataggc ttaggatgtt tctctactgc 178260 
gaactctgac tttgatatcg tgccaatcag 178320 
ctgtataaca ttaaatcctg ttccaatgtt 178380 
cagaaaaata cccacccaat gtcatcccac 1784 40 
agtggcttcc ccatgctatg cactgtctat 178500 
tgcattgtga aaagtcaatg tgaccatcca 178560 
tttgcttaat aaatgttagg tattattcac 178620 
atgtatttat cacataaaag ggtcaaataa 178680 
gggcactttg aaaatatgga aaactagtta 178740 
tggttactta tacaagtgat cctaattatg 178800 
aaaattattg agataataat tagttattta 1788 60 
ttctgtcatt aaaaagaccc acaagcctct 178920 
ttttttacca aattcgaaaa aagcaataat 178 980 
gtacaaaaaa tgttcttatg tttttaaaag 179040 
gtggggagaa caaagccaaa cactgaatac 179100 
agttccattt agaaatgcac ttt cat ttga 179160 
ctaaaaataa tatggagtgt ggaaatgctt 179220 
tgctgagttt caggaaatga agcttagttg 179280- 
atgcacaaat ttagcacaga gtctgccacg 179340 
tcacatttac tgcatccatc caattacacc 179400 
gctggcagtt cacaagagtg caacggatgt 1794 60 
cctgatatgc tttaagaaag cctgttcctc 179520 
gaggacgatg gcccatgttt cctcttgtgt 179580 
ctaatgcaaa ctgtcccaca aatctttcaa 179640 
taacaaacct caaccttatt tcagcagatg 1797 00 
cttcaacagc ttcaggaaaa catgtgtttg 1797 60 
catggacgag ctacatctcc tgctccaggt 179820 
cttcatttca gttattgata cctagagttg 179880 
gtttatttgg gagttgggtt tccttttgaa 179940 
tgttcacctg tttccttttc agaccaaatc 180000 
ttggggtgtg taaatatcaa gaaggataag 180060 
ctaataagag aagacaggaa taaaaaaaga 180120 
aaaagtacag ggacaggaca aagaaaaaaa 180180 
ggcttcatag aggaggtaaa ctctgtctga 180240 
agagaaggat aggctgaaag gtggaaggca 180300 
gaaatagaca atgaatttct agaaactaga 180360 
tacaattaag tgacctctga tttaatgttc 180420 
ctaatcttga aatagcacac ataaaccaga 180480 
tccctaggca taaagttaat ggtatagtca 180540 
atggctgctc tgacacacgc tacaaagttg 180600 
tgttcactat gactttcacc cccggtgcca 180660 
agggacttgc agttgtaatt aatgttgcta 180720 
ctgggtggtc caggtgggcc cagtgtaatc 180780 
cagacagatn agggaaagag gcaggacaga 180840 
ggtattaata gttttgaaga tggcggggaa 180900 
aagctgatga tgacctctgg ccaa.cagctg 180960 
gtgtggatgg tgatgataat aatgaagatg 181020 
ttaggagtga ttacagataa tgcagcaaaa 181080 
gaaggggtga gctcaagagg caggcaggaa 181140 
cttttcattt ctccttggac ttgcagatgc 181200 



84 



WO 01/40493 



PCT/IBOO/01507 



gaaacagatt ccttttctca gatgcccatt 
gccataaaga tcacatgtgg cattttaagg 
gaaaagctga agcaatagga ggagatgcac 
tatggaacaa cgagaaacat ccctgcagta 
ttgtgccctt tcttgacatt ttagactgtt 
acacatgtga gaagaaggtg ggtctagaag 
tgtttgttag tggggaccat" gaaggcgaga 
ggctcatgcc tgtaattcca gcactttggg 
ggagttcaag accagcctgg tctaacatgg 
attagccaag tgtggtgatg ggcgcctgta 
gaatctcttg aacctgggaa gcggaggttg 
aggctgggtg acagagagag actccatctc 
tccataaggt agtggaaatg ggatctgctg 
gaatgggata cccaaaagca gaggagctca 
gaaagcatgc taggggtagg attgcagaat 
gttcagcagg acttggacac tgggcatttg 
tagtgaacag atgctttgaa tgtgggagac 
gattaaggca ttagtaaata gcttcagtag 
tcccaagggt ttctagcctt catcctcagt 
taaggccatc cataatttca cacctatact 
gcataagctc ataataaatt acctgatcca 
acactcatca accacaaatt aagattaatt 
ctaccatcaa tggtatgaat gaccatgtta 
ctaccattcc ttgagtatgg atagtcaata 
aaagaatata ttctgtatgt ggagtccata 
gtaattagta tattctgatt tataatctac 
cagctcgcca ttaaaaaaaa aaaactgtta 
ggcattatgc agtgtgcttc attattatac 
gttaaaagct gtgtgtgtca ctcatacagg 
taaatacatg aattatttcc aaatgtgagt 
attagtgaat agaaataagt gatttatttt 
aaaggcatag agcagacaaa ggctaaattt 
attaaatttt ctcaacacaa ttttcctctt 
tggtctttct actaattaaa aacacctaaa 
tcaatgcacc agaaagaaag taagaattct 
tgagaagaaa gaagaattct gaaacaggtt 
acttgcaaat ttgtttgcac agcctctcag 
ctaaaaagat gtgagttgat cataagtact 
actcttagca tagctgtgag ccgaaactaa 
taaattgtcg ataaacttga ccctgaatgg 
aacatacaca caaacctccc atcaaccctt 
ccctccaatt tacacttaac gcatggtaga 
aaaacccaga gtttatttta taggcaattt 
ggagaatttc agcttcatta ctaaactcat 
ctcaccagca gatttcacta aagaatcaag 
acacaagagg catcagaaag aaggctgttt 
tggaatataa aatctatatt taatatcttt 
atcatggaaa aagaaaacat aagaataaac 
aactttcaga tataaaagaa ccagtggtcc 
gtgccttagt gaactgaggg acctgaataa 
gaaataagtg caatgaagga caacctagca 
atatctctca ctggagttct gaaaacagaa 
ggattattga tgactgatga atgatgacaa 
aaccaggata gttaaattaa aataaaacac 
ggagaacacc aaagtaaaaa gagcattgta 
tcccagagag tgacagttat atcaactgct 
gaaagtagag tgatgtttgt tctgtattgt 
actattgcct gtaaaactta caagaggccg 
actttgggag gccgaggcgg gcggatcacg 
cacagtgaaa ccccgtctct actaaaaaac 
cgcctgtagt cccagctacg cgggaggctg 



cattaggggt ggccatgtga ctgaattcca 181260 

cagaagcttt tcccttctcc cagcagaatg 181320 

catacaatcg aaggaagcag agaccctcaa 181380 

atagttgaac aaaaacacac tcccattgtg 181440 

cattttcaca gctggaccac tgcaaccatc 181500 

gaagaaagtg agggaagcta caaatattca 181560 

tagatttcaa tttcctcggc cgggcatggt 181620 

aggccaaggc gggcaaatca cctgaggtca 181680 

tgaaacccca tctcaactaa aaatacaaaa 181740 

accccagcta cttgggaggc tgaagcagga 181800 

cattgagcca agatcatgtc actacactcc 181860 

aagaaaaaaa aaaaaaagat ttctattttc 181920 

acctcctgag tgtgtaggag tggaggtgga 181980 

tggaggacaa gaactgtgag gactggaaga 182040 

agagctcaaa cttccaggtc aatgatgagg 182100 

cagtgaetcc atgtctcaat attgtctgta 182160 

tagaataaat gcatttttat aaagattttt 182220 

aactgtttac ttctttctac tgcaaagata 182280 

tgctttctta atagttacca tgaatatacg 182340 

tatcatctca aggcaagcct tgttagacag 1824 00 

gataaggctt ttttttttgt ttccccaatt 1824 60 

acatagatta cttggtctgt aaacatttaa 182520 

aaggaatttc ctaagtgcaa tagcagagta 182580 

ttttgtaaag ctataaatac gggaatctgt 182640 

ccaaatctgt tcaaaatgcc tatgttaaat 182700 

catataaact cattaaaaga aacccacaag 182760 

aattgatgga aatctggtac ttatttttta 182820 

acagattctt ctcactatag tatcgtttct 182880 

atacttacag agcgttttta aagcagtaca 182940 

taaattgcaa gtcacaattt agttaagtag 183000 

aatagccaaa aaacaagctt aaagcaggtg 183060 

tatggagcat ttgcacattt gcaagatttc 183120 

ctagtaatat gggagactag atattctaac 183180 

aagtcttggg aaatttgttt tataaatgaa 183240 

catttgaaaa ctaaatagag aaataaacct 183300 

tgaatttgag gatatcaccc aaaaatcttg 183360 

agtgcaggaa acaggaggac atccaggtcc 183420 

cttgatgagc tggaatgatg agaaggctac 183480 

agacaactcc tttctccagc tgatacaagg 183540 

aaggtaaaca cgtaaataca cagagataca 183600 

taataagaaa gaaaggaaaa agctagacct 183660 

tacagatata gacatggata tatatcacaa 183720 

acatacggca aaaagaatac aaatcttttt 183780 

ctctagagaa tgttactaga aaaatgagca 183840 

aaagaaagaa ccatgaggaa gaacttgcaa 183900 

agtaagaacc cgtaggaatg taatcacaca 183960 

aaaggaatga aagactagtt ttaaaatatg 184020 

atgatgattt ttaaaagaag aaccaagtag 184080 

tgggtatgac cacagcaagg acagagctga 184140 

attacccgga aggcagtgca gacaagcaag 184200 

caggtggaag aaagaatgtg aaggatgaac 184260 

gtggcaggaa aatataaaga ttgtaactga 184320 

ttaatacatt catgaaaaca tataaatact 184 380 

tcacacttta acacacctta tgataaaacg 184440 

aaaatagtca gcaagaaaag acagattaaa 184500 

aactttccaa gagcaaaagt gacattagca 1845 60 

tctgaaagaa aataactgcc aacctacaac 184 620 

ggcgcggcgg ctcacgcctg taatcccagc 18 4 680 

aggtcaggag atcgagacca tcctggctaa 184740 

acaaaaaaat tagccgggcg tggtggcagg 184 800 

aggcaggaga atggcgtgaa cccgggaggc 18 48 60 
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ggagcttgca gtgagccgag atcgcgccac 
tccgtctcna aaaaaaaaaa aaaaaaaaaa 
gattttgtcc aagtcaatcc tgattaaatt 
cttagagagg actaaaatgg aagtggggag 
ataaaaatca aaataataca aaaataatca 
gatagactaa attatgccat* atcgatacaa 
tcctaaattc cctgtattgt ttagggagaa 
ggtaagttaa ctgtaataaa aaaattcctt 
tataatgaaa atacaacata taattcccaa 
atcaaccatc tgaatctata agtaagaaca 
aactaatatt ctaaaaataa gtttagtatg 
aatactccag gtaagaaatc atcgatatac 
gcacacttaa ctaggacaac ctggtgacag 
gatataccct gttttataac aataagaaga 
catgcactgc caatgaaagt gtaaactgtg 
acctgaaaaa cgtaataccc ttaggtccta 
atagacatgc ttattattcc atgatttata 
tccaacaaca tgaaaatagt atttaaaaaa 
cataaggcat atgaatcaag aataaaattc 
cttggccaag agtatctcaa ggaaacccaa 
ggttaaacat gcctccttat acccttccct 
taatattaaa atagagatca gaagactgac 
aaattctaac ctgagtctgg tatagcatca 
atcaaggtat atatagtata tcaaatgtat 
aagctgtttt ttgtagagga aatttgcagc 
ttttctagag tctgacatcc tttaaggtat 
gaagcctgct acctggagac ttcatccaca 
ttatcttaac tcaattgttt ctttcttttc 
ttttttattt tttttttatt ccaataggtt 
aataagttct ttagaggtga tttctgagat 
actgtaccca gtgtgtagtc ttttatccct 
caaagtccaa tgtatcattc ttatgctttt 
gtgagaacat atgatatttg tttggttttc 
gtctcaaatt ccatccaagt tgttgtgaat 
tagtatatat gccacaaata tatacatata 
tatagtatat aaatatatct aaatatagta 
atactatata ttatatatac aaacatatat 
tattatatat atacacacat atatatatac 
tgatgggcat tcgggctggt tccatatttt 
gcatgtgcaa gtatcttttt tgtataatga 
tgggattgct ggatcaaatg gtagttctac 
ttgtactagt ttacattccc accaacactg 
tattatgttt caattttttg gttatggcca 
tgggtttgat ttgcagttcc ctgataatta 
gccattttta tattttcttt agagaattgt 
attgggggtc acatttcaac atgaaatttg 
caaactatat aagcaggatt atatagatac 
cattatttat atatttatat atttatatta 
tatataatat atatattata tatataataa 
tataataata ataaatatta tttatatata 
tataaatata aatatataaa taaagtatac 
caaatccata ggacacctat aaaattgaca 
agatttatgt tatgcagata actgcctctg 
acaaaaggac aactagttta cagatcaatg 
gaagaaccca aaatagatat tagaaaatat 
taagcctagt aacttacagt cagacagaaa 
tactaaaatg aataagcttg gcatgtattg 
ccaaagacag aagtaaatct catggtgaac 
gagggacctg gtgggaattg aatcatgggg 
tgagtgagtt cttgtgggat ctgatggttt 
gctgccatgt gaagaaggac atttttgttt 



tgcactccag cctgggcgac agagcgagac 184 920 
aaaaaactta caagacagga gaaacccaga 184980 
aaacccaaaa taggagtaga gaaataagaa 185040 
ttgagagcaa aaatctgttt acatgggtaa 185100 
tttatcttgg gataaaacaa aactaaacaa. 185160 
tatagcctgg ggaaaatgtg aagataaatt 185220 
ttaccagttt ttaaattatt tttaaactta 185280 
gggtaaccga gtagttttta aggaactgaa 185340 
aaaagtaatc acaaattaat gagaaaaata 185400 
tcatataaag gtgggataaa tagaaataaa 185460 
tgaataatta gaatatgtat aaattaaata 185520 
agcctccagg gtcttagaag gaaacaaaca 185580 
tacaatagag aaaatatttt taaaagtata 18 5640 
ttctcagggg agggtataca aaatatacca 18 5700 
taaactttct gaaaacaata caccccaaaa 18 57 60 
agaaaagtga cactaagaaa atatgtatct 185820 
attagcataa atgtagcaac tacttaaatt 185880 
ttcattaata gcatggggca atactcattt 185940 
aaagaactcc aactgattaa atagactccc 18 6000 
aaaactagtt cagaccacaa taggaagcgg 18 60 60 
ttggagttca ggcatacaac tgactgacat 18 6120 
aacacagract ttttgtaaca agaagatacc 18 6180 
catgacagat agcagcaggc tttgaaggaa 18 6240 
ttctttgaca tattttgaaa tgaccctgaa 186300 
ccgtaaataa tctccttccc ttactagata 18 6360 
gataggaaat acatatgatc tattgtctct 18 64 20 
taacaagaac cttggtttcc ataatccccc 18 64 80 
ttttcttttt tgaatttcag gctaagttta 186540 
tttggggaac agctggtgta tggttacatg 18 6600 
tttggtacac ctatcacctg agcagtgtac 18 6660 
caccaccccc tattctttcc cccaagtccc 186720 
gcatcctcat agcttagctc tgacacatga 18 6780. 
tatgactgaa ttacttcact tagaataata 18 68 40 
gccattattt cattcttttt tatggctgag 186900 
ctaaatatac atactatata tatatataaa 186960 
tataaatata taaatataat atataaatac 187020 
acacatatag tataatatat atactatata 187080 
acaccacatt ttctttatcc acttgttcat 187140 
tgcaattgtg aattgtgctg ctagaaacat 187200 
cttattttct tctgggtata tacatagtag 187260 
ttttagttgt gtaaggaatc tccataatgg 187320 
ttccctt.ttc actgcatcca cgccaacatc 187380 
ttcttgcagg agtgaggtgg tatcgcattg 187440 
gatgttgagc atttttccgt atgcttgtgg 187500 
ctatacctcc ccttaggctc cacttccaat 187560 
gagtttggaa acatgaagtt tggaaatatc 187620 
aataaagttc aatgtatttt tcaaggcact 187680 
tttatatata aaaataatat attatttata 187740 
tatatataat atataatata taatatataa 187800 
taaatttata tatataaatt taatatgcta 187860 
tccacaattg cctaatacag atttctttta 187920 
gatataaaat gtgtttccat ggtatttaaa 187980 
aaagcagaaa cattaatttg caaataaata 188040 
aatgccctct taaaaaagtc ttgggtcaaa 18 8100 
ttttaaaaaa tagataatga aaatatatat 188160 
tggatttaat gcagatatta gaaaagaaaa 188220 
aagaagttat aacaagacaa ggaagaaaac 188280 
tgtagttccc ataatcccca caactagtgg 188340 
gcaattacct ccttgctgtt ctcatgatag 1884 00 
tataaccttt gctctgcact tctctttgct 1884 60 
cctcctcctc catgatcgta agtttcctga 188520 
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ggcctcctca gtcatgctga actgtgagtc agttaaacct cagcctcagg aatgccttta 188580 
ctagcagcgt gcgaatgcac taatacatcc catatctcct taaaatgtac aaaaccaagc 188 640 
tttgcgctga ctaccttggg cacatgtcat caggacctcc tgaggctgtg tcatgggtac. 188700 
atccttaacc tttgcaaaat aaactttcta aattgattga gacttgtctc agatactttc 188760 
tgttttacaa attacctttt gttctagacc actcacaaga gtgaccatgc aaaatggaag 188 820 
tgtgccactt actcagctag- aaacgctcct tagcttttcc tctcactggg aatgaaagtc 188 880 
aatattctta ccttgacctg tcaggccctg tacttaagaa aagaacagca tgctttgacc 188940 
taaaattacc tctgaatggt ctttgtctgt aaaatggaag agacatggag aggcgccggc 189000 
acccgaggtg ctttctaaaa taaggggtgc cttaaaagtc gtgttgaagt ttgtcttcct 189060 
ggaataggaa gtgggtggaa tattttaatt agggaggtga tgtgattgaa tttgtttata 189120 
aaattatttt tggttaaatg taatagagaa ggaatttgat acagtgggta ataatgaaga 189180 
ggaaacaatt ttaggaatat taaaatatcc caggcttgaa tgaacgtaga gatggagaaa 189240 
tgtgtaaaga ctgaagaggc tgttgagagt tagatggaaa agacaagtgg acggattagt 189300 
gttgggataa gaaaggaaga aaggagtcat tgatggggcc atgtttcaga aggtttaggg 189360 
gtgaaggctg ctgcaattca ttgagaccag agcctttgaa agatagataa gcatgcttgg 189420 
gtgcagtggt tcacatgtga aatgtcctac gaaacataaa agtgaaagga ggagacaaat 189480 
ataataaaaa atatatattc ttgatgtcaa aactaaaata taaatatttt ggagactttt 189540 
gtgtgtattt ggatgctatt ctcatgctct tttttctgga aaaaataata caaaacaaaa 189600 
attatttcac gagttctcaa tttttgtcct tctgttttta tcagataaat gctaataact 189660 
ttcatatctt tcttctcctg cagtttccca tgtcttccaa atttatttca catcattatc 189720 
ttacttttcc tcaaggccgt tttttctctt gactacagca atgcagattt caattctgcc 189780 
attgaattcc cagacatatt cgtcatcccc attttcatcc cccaccaccc tgccattttc 189840 
ttcgtgttaa cttgttttcc tgactcacag aaatcacctt ttcctgtata catttttagg 189900 
atgtcagact ttattctaat gatttctcct agttgccccc caaaattgta ttctacrgtg 189960 
tgattttaaa gctgaatttt caagatgata tttcatatct atattttcac aagcttttct 190020 
tctatgaatg ttattgtcag ctgtcagggt gtgagatggt acttgatact acattctttc 190080 
caagctgttg cctgaatcgg tttaagacaa agtcattact aggctgtaaa ctgttgctct 190140 
gcaaaattga gcagcacgta tttaaccact catacttctt agctctccaa cactttgagt 190200 
caatagagta tacaaacacc tgtaatttcc agtgagcaca gctcccaggc tctctctctg 190260 
tctctctctc tctcttttgt tttttttttt ttgcctcagc ctctcgggct ttatttgtgg 190320 
gatttgcacc ttcaccttct gcatgcagtg tgccatggct agttcccctt acatttttgt 190380 
catgattttg gtagaggcaa tgatttcctg gttgtagaaa aaagggcagc aacataggat 190440 
tacaaaagta ttgctaagaa attgcaacaa gctacttgca gtttatgtat gattactcac 190500 
agtttttgga tggtgcaacc cttaatttct gaaccagcag gtgaaagatg agagaagaat 190560 
aagagacctt tctatagatc ttcaggcaga caggcttgtt ggcttcctaa tatttcggtg 190620 
ggatactggg atgggttgga ggcaacatac tgtttaaata agtagacagt tttttaaaaa 190680 
taatgaattg tgtctctaag atatattttt accagaatga tactatatag gaatatgcgt 190740 
aaacccctgg actttgatct caagcaaaac cttttcccta ttatccctcc ctcttaagtg 190800 
catgagtttg gtaaagtact acagaaattt gttctctcac agttctggaa actagaagtc 190860 
tgcagtcaat gtgttggtgg ggttggttcc ctctggagaa tctgttctgt gctcctgcct 190920 
taactttagg gcattgccgg caacccttga cttgtagatg catcacttca atctctgcct 190980 
gctccacaag gtgttgtcca ctgtgtctgt gtccagtgtc tttgtccagt gtctgctttt 19104 0 
tcttctaata aggacattag ctatatgcac tcaccctaat taagtatgat ctcattataa 191100 
cttgattgta tctgcaaaca ccccatttcc aaataaagtc acttttccta gggttaggat 191160 
ttgttttttt tttttttttg gtggagggag aaaggaagga agtgggcaca gttcaattca 191220 
taaggaacca actaaatata aatgccactg tggtaagatc agcttgacca tgtagatttt 191280 
taaatggtgg ttttctttgt ttctattgcg ttttttgctc atcttatcag aaaatttgat 19134 0 
aactttagat aataaatatg aaggagtaaa tcattaaaaa tttaaagtga aggccaggcg 191400 
tggcggctca cacctgtatt cacagcactt tgggaggccg aggcgggcgg atcacctgag 1914 60 
atcaggagtt tgagaccagc ctggccaaca cgatgaaact ctatctctac taaaaataca 191520 
aaaattagcc gggtgtggtg acaggcacct gtaataccag ctactcggga ggctgaggca 191580 
ggagaatcac ttgaacctgg gagatggagg ttgcagtgag ccgagatcat gccactgtac 191640 
ttcagcctgg acaacaaagc aagactctgt gtcaaaaaaa taaaaataaa aataaaaata 191700 
aaaaaataaa gtgaatccca cagatttgct tatattcttt ataactagat taagtgtcta 191760 
caacagggct ctttttgact aactttatgg ggatatacaa caggtgccaa ccactccaca 191820 
ttccaaaaga aggcaaagac ttgctttgca ggacttgcaa agaggtgtcc aaaatgtttt 191880 
cctggcctta ggctcagcat ccccccgaag gttatgaagg aagagctgtg gtctcccctt 19194 0 
ggagagattg gatggatgct gagtgctcat cttcttcccc ttccctgaca gacaacttga 192000 
tgtgcatccc tggacttaca gggacaaaga agtgtccgat ttattccaga aaatatttaa 192060 
aggatgagag gcttgcaaag aggcccagag caaaatcaaa gaaggctata gcctgaaaat 192120 
aattttattc ccccgatagc aggccagatt gcatctttct tatcactctc tggtggacat 192180 
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tgtgagacag agcttagcca gggattatct 
gaggcgatct ctgcagaaag cagctggaag 
cagcctcagg agaacagctg aggttgaggg 
tggataacca ttcgtgggca gggatctcaa 
gcacccaacc tcctattact ttcaaaggaa 
ataactttac tagtgccgtg, ctctcccttt 
tgagccatag tgtttcaagg gtgtgtgatg 
gaagaaagtt tactacaagc ggcctgctga 
ttggatcaaa gttggcatct tgattcttat 
gacaattctt gagaggaagg gtgattagag 
ctttatgaga aatgctctat atttgattaa 
tggttttaag taggcagtct gtgggggcgg 
tgctcatttt ccaggttacg taaatattaa 
ttcataaata gtccaatatt aaaagttgat 
tactgaatgc cctctagcct tactacaaaa 
aaaaataagc cagctccagc cactttaagt 
ggacagggca ggaaatgaag aaattaaagg 
ctattagacc aggtcatttc cttggctggc 
ctttggctga aatcttttat ttatcctcca 
ctgtttctac atttaccatg ctggatcagt 
tgatattcgt actaaatatg taaatgttta 
gagttgtagt tattgtaagt tgcatcgcac 
gagaagtatg ttccattgaa atttgattgc 
ttgtgaaata aaccattcaa tctggactgc 
aagtcatctt tcacttatgt tttaatatgc 
acctaaagct caatgattgg acacttatta 
atagctcaca agaaaatttc accaaagtca 
ggtagggttg atagacatta gcattctgac 
atttaaggga tttgccccca aattaatcag 
tctaaagaat attcaaaagg tcccgggcac 
tggatcatct gaggtcagga gttcaagacc 
tactaaaaat acaaaaaaaa aaaaaaaaaa 
tccagctact agggaggctg aggcaggaga 
gcgagccaag attgcaccac tgcactccag 
aacaaaaaca aacaaaaaga atattcaaaa 
catcctcttg aggaacacag gtgactccag 
ctgctcccct ccttttggtt ctagggactt 
ctagtttaga gtttttctat tacaccacct 
ttttttttta atctcctgga tcacaatata 
agggctccca agaaatgtgt aataaaagtg 
agtagattaa taactaagaa aataaggcac 
ttcaagagtg ggaactgatt agagtggtag 
attagacata taaaaagata tctagattct 
agcataaata tgaagttcaa gtataaactt 
gcataaaaaa ttagcatcat gattattctg 
gttgggtcag agccattctt tattcagtta 
agagacttct tctttatgat aaaaaagata 
tactaattaa acagctaatt ggaaattgga 
ttattttctt ttctgtttgt ggaagttttg 
caaagagttt caggcatggg ttataggact 
ccataacatt ctatgagatt tcgtgagcaa 
ttaaacccta ttttaaagta cttacttttt 
ggattagtaa tatcattacc tatggaaaca 
tctccttaat ttccaagaac attaatcttg 
tctttattat cttattttct atgaagaaaa 
taaaaacatt aaaagccagt ggataactgc 
aataaaataa tttattctat tgataattcc 
taaggaaatg gacaaaggaa gaaaaaacag 
gaaatttttg tcacgctaca attattcttt 
agccacaacc attttatctt ctgacttact 
attccaggtt atggttataa ttactaactg 



tagatcttgt caaagaaggc cctggccaat 192240 
tgcatcatcc cagatcccag gttgagagac 192300 
attcattcta cctgggaaaa ctgcatgtga 192360 
agaaccccca agtatgccct gaagaattca 192420 
tcaaacaacg gtttccaagg gggcatgcca 1924 80 
ctctgtaaca cctctcactt cagtcctccg 192540 
aagcgatgga agcctcaggt ctggaaatag 192600 
ttaaagtggg cccaaacttg tgaagaaaac 192660 
ggagtacaca gaattgcaac caataaaaga 192720 
gacattttgt tacctgggag tgaacagata 192780 
gtaacaggga tttgcctctc aaagtggggg 192840 
ggtaagaaaa tggatttctg attttatctt 192900 
gaacattgtt aatttacttt aataaagaag 192960 
taaataatac ttttggaaat acatgaattt 193020 
agaatgcttt gcaaataata aaagctaaat 193080 
gacggaacac actctgtaac ttggggatgg 193140 
caagggcgcc tcactgaggt atatcaaatc 193200 
aactggacag ggctcctatt gtcttgtcta 1932 60 
catttgtttc atttgcaggt aattctattc 193320 
tttttccctg attttatttt taagttacta 193380 
ttattgggtt ttagctgttt ctataataaa 193440 
taagctttaa aaatgtattt ctccactgta 193500 
acacaaagct tgacatgagg tttattctct 1935 60 
atccacattc atctgaaact tgaaggaaat 193620 
catatttcta ttaaattaat ggaaggttag 193680 
ttaatgattc tttgcacata gagtgcatgg 1937 40 
tcttatatat ttctcataat aatcctaaga 193800 
atcagaattg gagacagatt cttctgtctc 1938 60 
tccatgataa agccagctct ataatattac 193920 
agtggctcac gcctgggagg ccgaggcagg 193980 
agcctagcca aaatggtgaa acctcgtctc 194040 
tagctaggag ttgtggtggg tgcctgtaat 194100 
atcgcttgaa cccgggaggc ggaggttgca 194160 
cctgggcgac agagcgagac tctgtatcaa 194220 
aggtgcatgg ggcaaggaga ggtcatttca 194280 
accacaattc tcctagatgg ccgggggccc 194340 
gattcctgaa gaggggtcat gcgccaacat 194400 
ggtctcttat attaatttgt ttccctgatn 194460 
atcgtttgaa tttttgttgt tgtgtataca 194520 
gttggtggga aaatcttgct aacagaatcc 194 580 
acacaacaaa aggttgtgca tggctgcaag 194640 
aataaataca tgtgcatgtg ttctgaatcg 194700 
ttgaaaaatt aataagaaac tttttttcta 1947 60 
accaaacaac agagaagtga taaagtgcta 194820 
tattgtttac atagatctat caagtcaaat 194880 
tcaaaatggt agagccattt ggatgagttc 194940 
attgtaaatc aaaaaagagt ttaaaggcta 195000 
agatttgttt ctccaaactt ctactatttt 195060 
gaaagcaaca ttttattctg tgattagcat 195120 
ggaccatgca tacttgaagt gtacttcaaa 195180 
ctacctctta ttgataaaaa gaattgctgt 195240 
gctttctcaa atataaggat gatgtactat 195300 
aaagatataa aaatttactg ttcactcact 195360 
aatttaagct taaacttact ttaaacatta 195420 
taatccaacc tatctactag tatgccactt 1954 80 
aaaagaaggg cttttaattt tatataaaat 195540 
aaactctcaa ataatatctt gctgcagtgc 195600 
cttttcatgc ctttcctgaa acaattgcag 195660 
gatcatgaaa aattctttta aagacacgat 195720 
aagcagtctg agggtgaata gagagtttcg 195780 
caaaaatcct attgcctttc attaagtagt 195840 
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taaaatattt 
ccctacacat 
acatttttaa 
tttttctgca 
aactttgcac 
tctttatttt 
cgccagctta 
aaaatacctg 
tacttatctc 
ttttctgtgc 
atatattagg 
ttctattatt 
gaattaacca 
ctgaaattat 
attctttaaa 
ttatccatca 
tttatgaaca 
agattatgct 
agtaacttta 
gagacgtaaa 
gggaaataat 
acta tat cag 
ttratgatct 
gagacagaat 
ttttgaaatg 
cggatggcac 
aaagtggatt 
ttgaaaagat 
gaaaaccaag 
agcttct agg 
ttgtcactat 
catccggggc 
aattactgtg 
gatggaagta 
agaagatgat 
tctcttccat 
agtggtttct 
cagctgt gtt 
ttccagttaa 
tgattattgt 
atttttacac 
aagagcttat 
catttctccc 
taaaataagc 
aaataataac 
tttaaaaaaa 
tccccaagat 
agagataaga 
gcaacattaa 
ggaaagatgt 
ctacttctca 
accatctaca 
aagaactcaa 
gaatgcaaga 
attctgttga 
acttagactg 
tatttatttt 
agaaaccaat 
ttgttaattt 
gggaagatat 
aatttatctc 



aattgagact 
acaggaacac 
tttaagtttt 
aatgaggaca 
tgttaatata 
ctaacagctt. 
gcgggtcagc 
gatgcccttt 
tctgaccact 
aaatctcttt 
aacctgaaaa 
ggaaacggaa 
cttgctactt 
tgaaacaaaa 
cattctcttc 
cttgtctaac 
gcattcttgc 
ttgctctctc 
gctcttcaat 
ttcaaatctt 
aaaaagctgt 
gtataactcc 
attactatgt 
tgccacctat 
taatactttg 
cctgcccccc 
cttagttata 
gttgggagtt 
agcacgaacc 
gaacaacgcc 
ttcctggcat 
tctctttatg 
ttgtatcttg 
agaagtacca 
ttcttaggaa 
attcttggga 
atactatatt 
aactgagtct 
gtctactttc 
taatattatc 
acccactgtt 
tcatcttcca 
tcctttcacc 
cagttacaga 
gcttattcaa 
aattgagaga 
ggcatattgt 
agagtaaagt 
aagaaatgct 
agtaatataa 
aaagagtaag 
ttaaaattag 
tatgtttaac 
ccctcaaatg 
atcactcaat 
aaagatactg 
ggaactcgtt 
ttaaatatca 
tcagactcta 
tattactttc 
aagacacaga 



acagtgtgtc 

tgagggtact 

gactttaatc 
ttaagctaaa 
aatggataaa 
gtatgagaaa 
atcctctcct 
gctccagcct 
gggaaactgc 
gaaaattgcc 
taaaggaaag 
atatggaatt 
atgttaatgt 
ataaagtatt 
aactttatat 
tggaaggcaa 
tttgttgcaa 
tgccaggtac 
caacacatcc 
tattcaatac 
ataattagag 
gcaaaattgt 
gccaggtgtt 
gtggggcctt 
atctgaattt 
accctggtgt 
aacataaaga 
catttatctt 
aggcataatt 
aggcacgact 
tgccagctgc 
ggggcatcca 
ggttttctca 
catctgagag 
aagaagtgag 
atttgagcca 
tgatgatgtg 
ctagaggcag 
agagagccat 
gtggtaagac 
gtgttgttaa 
taataaaact 
tctgcaacag 
aggacacata 
atacacggtg 
gatttgcttt 
tgaagccata 
gtggtataaa 
ttatcttggg 
aaagatttca 
aatttaactg 
gaattttctg 
tctattttaa 
ctcaggaatt 
aatggttatt 
cttacaatta 
ttaatgcaga 
tgctgattta 
catagctatt 
actttacaaa 
gctggttcat 



tgcacacagc 
tttaacaata 
ttctccgtga 
acaaatgaac 
tctttaatat 
gaagaactgg 
cccacataag 
cgtgctccca 
taaaagagag 
acctccaggt 
tttgcacatt 
ctagaaaaag 
gacgtggttt 
ttcccctgga 
ttcaaaattg 
gcatttttat 
ttacaaacat 
caattgtcaa 
agtattaaga 
gcaagaagtg 
tgtcaggatt 
agatggttgg 
ctgtctgaaa 
gctggggtgc 
tactcctgct 
agtttgttaa 
actatcaccg 
cagggattct 
caaacttcac 
gcctggaagt 
agaggcagct 
cagtgggagg 
atttatacaa 
cggaagaggc 
tttgggaaat 
tcacttatct 
gatctgataa 
atctcctgag 
tattgaaaga 
cacttaatgt 
ctataggtgc 
ctgtaccatt 
aatgataaac 
ttgcatgatt 
ggatttttac 
agttgatcaa 
gtgtgtgcac 
aaagagaaaa 
ggagatcctt 
atacaaggaa 
ataaggttaa 
agaatttgta 
acaggaatgg 
atcaagtaaa 
atttaggata 
taggtgggaa 
ataatcctag 
aatagtataa 
tttgtctctt 
tgggataact 
ggtagagcta 



gttgtaccat 
aaacagatca 
ttaaatacct 
cataaagtga 
gtattaacca 
tgatgcagaa 
cagagatcgg 
caactttgct 
aaacttgaaa 
tagcttttaa 
gaaacatcac 
aaatgtcatt 
aaaaaagctg 
atctaagtaa 
cttcacagta 
tgttttagtt 
ttatttccat 
tagaaaatag 
aaaaaaatac 
taattgtggg 
tacaccctcc 
caactaatcc 
cagcatggga 
agaaattttg 
taaggacacc 
gaaatgactt 
gctagtaaga 
gcagatgggc 
aacagtgagg 
tgctggttac 
aactaacaga 
tagtgggttt 
caaatgtata 
actgtaatca 
atattagatg 
ctgatgtgcc 
cacaatattc 
aattgtcagt 
gaaaagtttc 
gagatctatc 
aatgttgcac 
actgttgaac 
ctggagagca 
ccacttatat 
tgggaaaaac 
aattataagc 
aggtgttgaa 
agtgctagaa 
tggccaaact 
ataaagaaat 
ttattataaa 
ttctttttgt 
ctgcctatgt 
gtttatgtta 
tgcaaaagtt 
ataccataaa 
cgctctattt 
ataataaaaa 
ctattcttat 
gagaactcaa 
tcatcagaaa 



gcataattat 
tcaaaaaatt 
atctctgcat 
cgggaaaaaa 
ttgcactatg 
agtcaaatac 
agcatgagat 
tctgacagag 
aggattggac 
gagaagccca 
ttgctaatag 
atggggttag 
caggaagttc 
tgtaagtgaa 
atttcttatg 
actatttagc 
agggatggaa 
agccaactag 
attgatattt 
agattaggtt 
actggaagct 
ttgctcacat 
agaggaatca 
tttctcacat 
aagatgaaag 
tagttctatg 
aaggatttta 
agagagacta 
gtaccagaac 
gggtgcttga 
gccaggggtg 
tgatgcagga 
catacgttca 
cccatcccaa 
tattctcctg 
ttgcttgaga 
aaatattttc 
tgtgtttatt 
aacagaaaga 
ctgttagcac 
agcagacttc 
agcaacttgc 
ttatgctaaa 
gaagtatcta 
gctcgctctt 
taccggaaac 
taggcccttc 
tgcagtcatt 
atttcaactt 
tgcatccgtg 
attaactttt 
cctttcaata 
cttgcagttt 
agaaactagc 
aatcacaaaa 
atacaacctt 
agtaacaaga 
tgacaatttt 
gaaatattca 
taaattgtat 
tctcacgatt 



195900 
195960 
196020 
196080 
196140 
196200 
196260 
196320 
196380 
196440 
196500 
196560 
196620 
196680 
196740 
196800 
196860 
196920 
196980 
197040 
197100 
197160 
197220 
197280 
197340 
197400 
197460 
197520 
197580 
197640 
197700 
197760 
197820 
197880 
197940 
198000 
198060 
198120 
198180 
198240 
198300 
198360 
1984 20 
198480 
198540 
198600 
198660 
198720 
198780 
198840 
198900 
198960 
199020 
199080 
199140 
199200 
1992 60 
199320 
199380 
199440 
199500 
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attagttatc cctttccaat tctgaagtcc 
ctaggaacta ctattgcagt tggaactctg 
agaaaatctc ttcttaagag aggttttaat 
agcccctata tcctcagagc tcccagcatc 
aaagttcctt ccctcctcct gccctgtgtt 
taactacaca ctaagtgtga aagtgattgg 
ggtcaagggt tgaaaagaat taggggatta 
tcaatgtatg tcaatgaact ttattgaaca 
tgttgtacac ttacacatcg aaaccagggt 
aaggtgataa tgctggcaca gcacacatgg 
cgangcgggt ccgtatttct acacacctgc 
agccctggat gcttagctgt gtgatctgtg 
ggttagaaat acacctctca ggcccaccca 
aaattcccaa gtgattctca tgcaccttaa 
cttaaaatat tgctatggga tacaggcttt 
tcaagtttga gaaacaggag atttaataaa 
tctgttttga ttggtgtagg gtgtaatctg 
gtttcatgaa tggcaatgtt tgggaacacc 
cttcatcata ttgcagatgt gggaactgag 
tagagccaga cttaagctta gacccttaac 
ttattatttt acaggagagt tctatattgc 
tattttaata tatgcccrtg atggataata 
gcaaagtatt gatgctgttg tgtctttgaa 
gctggaatgg cagacacacc ctttatctgg 
ggccgggata taaagaaggc agaaaatcat 
aaggactctg tacttctata ggtgagtcct 
cagacaaaat ttaaaaccta atttaatagt 
agtcccagtg ggattcaacc actcatagac 
caaacgccga gctggaacca atctcactgt 
cactttacct tttttgtcta taaatttatt 
gagtctgctg tggttctggg gactgccgga 
cctttaaatt aattcagctg aagtttttct 
ccttccagcc tgcacctttc tcccatgctg 
aagttcttca gttttgggac tcagactggc 
•ttgtggggtc ttgtgttcgt gtgagttaat 
ctattagttc tgtccctcta gagaaccctg 
gaaaatctat taataaactg atatagtcag 
caatgttgta gagaattatt ttattgctct 
tacgatgctt ttaagttgat cacattcaaa 
tcaatttcat attgactctt gcctttttca 
aatcctgaag gttatacttt atttttccat 
atattttaaa cacattttta aaaatgaatc 
attagagagg ttaattcgat gacgaactct 
tatttcaaac gcatctctct tccaaatctt 
agccagcttg acagcccgtg aatggctact 
tggtcctatc tcttaaagac agagcaacag 
aatctcatcc cagtggacaa ctgggaaaac 
gcatcattta ataactaata catactgaat 
tataaggagc ctggcatgtg tttcctgatc 
. gtgtgtaaac ttggtgtcga gaattaaaat 
gttacctatg ttcatcgcct ccaactcatt 
ttgaaatcta ggtctgtctg cttatagggc 
gctcaattga tcctttcaag gctccatcac 
tgttcactga tgggctatgt tccagaagtt 
gagagagaga agacccataa atattctgaa 
cagaaaataa attgtatttc caaatatata 
attctctgaa gggccacttt cttttctttc 
tattaattca ctggaataag aatcacacta 
gtcatttaat gtgcagccat ttacctggtc 
ttgtgctatg attgatggca atgcagtcta 
gatttcctag gttctccctc agagtttcta 



tctgggaagt gtcagaattt acactcctca 199560 
aggctttatt tgggagtttt tcctgattac 199620 
aaacaaaata agtcggggat attaccagat 199680 
acacctttta taagaacagt atagaaaaac 199*740 
atttttccaa aaaacaaaca aacaaaaacc 199800 
agtgggggaa ggcaagctgg ggctgtacca 199860 
ttacagttct agttcgaaga caaacacttt 199920 
gccgccatat actagtaacc ctcttaggtg 199980 
ttcccagatt tcttatgatg aaacagatag 200040 
acagtgtgac ggaaggagac caattaagga 200100 
caccagctgg acggctgtgc tggaagtcta 200160 
aaccggcagc atcagcgttc ccagggtgct 200220 
gaccttctga atc'agaatct gctgtttcca 200280 
agctttagaa acactgttct aattagtgtt 20034 0 
agtaaattct caggatattc taccatgtac 200400 
tactgatgtt tggatcccat ccatagagat 200460 
gcattaacat ttttaaattg cccctggtga 200520 
tgctttaagt taagccatga gctaattgtt 200580 
acgttataag gaaatccaat ttgcgagtgg 200640 
tcacttattc tcaaaactgg tctgtgctgt 200700 
tatgtttaat acattagtat catttaaata 200760 
ctgagtgtcc acttgattgg attgaaggat 200820 
ggtgttgcca aaggagattt gagtcagtgg 200880 
gtgggcagca actaaatcag ctaccagctc 200 940 
gatggttaag caaactaaat atggcctgag 201000 
tgtggatgaa ctgtaaccta gcttaatagt 201060 
atgcactgta acaatggctg aatgttgccc 201120 
tgcttttgtt caaactgcgt tcaaataagg 201180 
ttctgtacct cgcttctaat tcctgtacgt 201240 
ctgaccacaa ggcacccctg gagcctctgt 201300 
ttcgcgaatc gttcatttct cagttaaact 201360 
tttatcaatg aaaagactag actggcttag 201420 
gatgcttcct gcccttgaat atcagactcc 201480 
ttccttgctc ctcagcttgc agatgcccta 201540 
aettaatacc ctctccttta tatatacatc 201600 
actaatacaa tgtctccatt gaaaagaaaa 201660 
agcccctcaa gaaaaatacc aaagaaactc 201720 
aaagacatca aaataattac atgccttgtt 201780 
gatgtgtcct atccattcat gtacatacat 201840 
gcagtgatca gtaaaaatta tctaaaaatc 201900 
tttgaacctt gcccttgaaa gatcatttat 201960 
tccagaagtc taatttctca tgctaaccaa 202020 
tagaaacaat catcattaga . acttgggtat 202080 
agcagttgaa acaatttcac aggttaaacc 202140 
cactaaaggg tccaagcaga ggcaaaatga 202200 
aatatttgga attaaataga actggactta 202260 
ttggaaacat taatcctgct gaatatttgt 202320 
taagtagtta gcaggattta caacactgtg 202380 
tttggtgttt ttagagttaa tattatgaca 2024 40 
ttgcttaaca gttctagtgt tctgtacact 202500 
gaggctcatg atgctaaggt gaattctgga 202560 
ctttttccaa cttcagggtt tcctgctttt 202 620 
rtttgagcta cagcaaatca tggtgtccrc 202 680 
gcaagaaaag taaaggggga taaaagaaga 202740 
gtatatgact aatctagtat aatggaagct 202800 
tttctgaatt ttttggtatt tgttattggt 202860 
taatgctcta tgtccttaca aaggaatgac 202920 
attggcacac agtcccaatc tgcttggcat 202980 
aagtgaatac ttgaaaatta aacagttttc 203040 
gggttaagtc ttaaacgtaa tcccttatag 203100 
actccataga tttgggggtg ggtcccagaa 203160 
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tttccatttc taacacgttc ccaggcgatg ctgaagcttc tggtccaggt accacacttt 203220 
gaaaactgct gttttaatgc ttacatgggg tagcatgagt tagttttgcc tggggctttt 203280 
aattccaagt tctgctactt aacagctgtg ttataatgat taagttatgt gatgtctctg 203340 
tagcttgatt tctttgtata atgaggaaaa taatcatwcc cttttacata agattattgt 203400 
gagaatttaa taaacaagaa atttcttata atgcctggta catagtaatc actcaataag 203460 
ttatgtcaat tgtcaaagca catgtctgaa ttatagatga gagagaaata ttgtgaattc 203520 
tatggcactt taatggcctg"" ttctctctac ccatctctag ccgtacctta cactggctta 203580 
tcctgtgctc ttcctttcct gagggaaccc aaccacatat actgcctaaa gcactctttt 203640 
ggcacttcta acataggtgc aaggaggaaa tttcaagaaa ttatgacaaa ctacaagact 203700 
caaatctcct tttaaatttt aaaaatcaaa acatgtattc atatcccatg gcacatatgg 203760 
atgaagatat tgtttttctc agttaatatt ttagttttgc ttttgtgcca gagccttcct 203820 
ttttttttgt aaatgtccct tatcaggcta ctgcagaatt gcacaaatac tgaattgccc 203880 
caaaatattt gcctttaatt tgtttcttta aagcaaatgt aaaactacaa tgggttatat 203940 
ttacctagag gttttatcac catacgctta atttaaactc atataaaaat agaattagtt 204000 
atatttagtc ttattttcat cattcactta atttaatata attcttagtc cataacttag 204060 
atataacaaa aatttgtatt ttttctatcc tccatttgcc ctaaatctcc ttaaacaaaa 204120 
caaaacccaa ggaaacawtt ttgcctggaa aacgtgaagt ttgattagtt ccacctacct 204180 
tiaatgcactt taatttgggc cccttctcct cggtagtcta gaatataaac tttttttttt 204 240 
ttttttttta agagatctgt ttggaaaacc taccaaaaca agacaaactc aatgatttcc 204 300 
ttacagagag aagcctgggg aaagaattag caggaatttc tcaacgggtg ccgctgctac 204360 
tagtttttta tgcatgagtg cttgtcattt ggcagaaatc aattgttctc tttacaccac 204420 
aagagtggtg gaaatggggg taaaactgca agttaagact gctctgaatg tggattcaac 204480 
acttcactgt tgtcctccaa catgcgccct caaagtacca cttggctggg aattaaaaag 204540 
ggttgccttt agcaaagctt taatttaaaa tagtcacagg tgacagctct attatttttg 204 600 
cacgytcaga tgcagccaca gctgttttta ttgatctcag ctgcatacac acccatttgt 204 660 
tgtaattatt agtttctgat gtagctgaag atacatatct cgaatgccta tcaaggctaa 204720 
attaaattaa ggaaatcgta cttttcacct tcaccctctt tggtcctgct ttgctgcagg 2047 80 
aaattgccac aatgtaaaac tttgagcctc , atttcatctc cccccactag gagtgatttc 204840 
caaagttgtc ctaggttcct gatctattcc aatttaatag ttactgacat tttttccctg 204 900 
tatatttaac acactacaag. caattaccat gtgrcaatta tgaatttaaa aattaaaaaa 204 960 
aacacaatta ttgctactac tgattaaaca ttgttaggac tttgaagaat aaaaaagtga 205020 
attcaacctg ccctcggctt tctaaaaatt gcctttttgc tcttaaaaat gcatgtacag 205080 
cttttttcat ataaaaataa tttcgagttt cttctatacg cataaaatat tttcagctat 205140 
ttttgatttc aggaagtatg aaaaattaag tatacacaaa agaattgcgg catgacccgt 205200 
tcgaamtctg tctcagtgac agacagtttt ggattgtaga cctcactttg ttttgcgtat 2052 60 
ttctgtttgt tgtttcacct gtggtccatt cactgtattt gatgtgttgc aaaacaatgg 205320 
cataacggyg ttcccaccag taatgtaaga agaaacatgg aggaagcaag taagggtcta 205380 
ggaaatttgg gagagaggaa aagagacata attaatgagg agagggatca agaagattgg 2054 40 
caggattttt agaagggaat ttgatttgca aggtacccag aacatggcac acaaaattgt 205500 
aagaaaaaca aatttttctt gaacatttat ataaagattc ctttgggaaa atagaacaaa 2055 60 
tttggttgta gagtggtctg gggttgattt taaattggaa aaacacagtt ttgttttttt 205620 
ttaatttgaa ataagagctg atatgcaata ttgtagggga gttggttgac tttttagtca 205680 
gaatagtggc ttttacaacc atggtgacaa cattaactgt ggatcctctt gacgtcttca 2057 40 
taagttcatt aatgctgggg gaaaatatgc atagcaacta atctaccaat atttgagaga 205800 
attaggcaac ctctcaagaa tgtaggaaag aaatgtaagt caacaagctt atacaatgta 2058 60 
aaaaccatga ataggcaggt aataaataat agaaattgat tttttctgtc cagtaaagac 205920 
catgaaaaga gaaatattat aattttaact tgagtgctct agaatactgg gtttgttttt 205980 
ttaattgatt aatatgattt taaaaaatta tcatgaattt aaaaatattt tctccattac 206040 
tttgagttat cactttaaac atargtaata tcaagatatg tttctgaatt atatatttaa 206100 
ttatgaccac taaagagcac ccatgaaatg gaagtgttaa cagcaatgtt ctgttgtagg 206160 
aagataatca tattggttat gtctgcttta tagtaatgtt attgttgatt gatttgcctt 206220 
ttcctaactt aagtcttttt ggtcttttct tctatttgtt ctaagagacc ctgttcaaag 206280 
ggacatttta ttttcctttg tacaacagta tttgcctgac atttactctc ctctaataaa 206340 
taatattaaa ctctagtcct aatacattta cttacctaag gaaatcggcc ttgtgatgga 2064 00 
cccccggagg tgagatacat ttctggatga ggatctgttg agaagttgag catttgttca 2064 60 
gctactaagt cactgccacc acctcgttag ctaaacatac tctgtgctca ttccttcatc 206520 
agtaaatttt ggttgggaca catgrcaggt gataatagag gttattagct gcaatgctta 206580 
ttaaaatatt tgaagatata ctttagcaca ttttaataat tattgagact gtcccttagt 206640 
aacagtagct cttctaaatg aaagctttta aaattgcatg taagtcaaaa caaagcatta 2067 00 
tttgttattc aactttatat aaaagctgtc cagtgatttc aaatatgtac taacatttaa 2067 60 
tttcaattca aaataagtat ttttatgttt ttataaagtg attaaatcac tttattgaat 206820 
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tctatccctt tataaatgaa taaaggaatg 
tacttttccc ttcccccaat taaccccaag 
gatacgtcaa ttttgttatt tattttgcct 
atatatatca gcacaaatta gcaggcccac 
nnttctaact cgtgaaataa actaccatgc 
gggttgatca taatgtcttc. attttagtat 
ggtagttttc aatcggattt atggggcatt 
catacctttc ttctgccaaa aagaagtgtt 
ggtaaacatc acrgtctgaa agggaactat 
gatacaaata agattttttt aaagtagaca 
actttatgga aatgaattca caagataata 
ccccagatag atttcttcca catctgtatg 
tactgctgga gaatacagaa aatattcatg 
caatttgtat aataagtgta acaaaatgca 
actccattat agacattttt agagcaaacg 
aaatattcaa gacatttaag tatgaattac 
tatcttcaat gctttctagg gctaacaaat 
aatattatgt actaggatat gtggatatga 
acagccaaat aaacaaataa ttagtgtaat 
ccagagaaaa attgaggttt gctgaccaga 
aaaaaatgta tatatatata tatccactac 
ctaattctct tattcataat tgagtttcat 
tgagaaagca tgtttaatat gatgataaga 
aagtgtgccc acacactgta ggcactgaat 
ttatctataa caggaatatt ttccrtcata 
cgcagagaga tgtgctatca catcccatct 
atgaagaaag a'ctttttgaa gtgcagcatc 
aaaeaaatat tattctcatg gttttgaata 
tatcacaatg gacaatgcag ggacccttca 
aactgatatt ctctattagg aaggttagct 
aaaaaggagg atcatttgag accaacttta 
atacagaaaa catcttcaca agtccacttg 
ctaattactt ttgcaccaac ctaattgtaa 
cggagagagt acacaactat aaaagcaagt 
tatttgttga acatggccat tttgacaatc 
cacaatctgg agaagactgc ' tcttgcacac 
gcaaattaca ggtacattgc tgcacaggtg 
ctctcctctg gagaggaatg cagcacattc 
tcatctagaa atacacctta gtwatcccct 
cttctgtgtt ttagaatttc tgttttatgc 
tcacgaggtc aggcacagct aactgttgat 
ctccatggag cagccccttc caagaaagtt 
gactcagctg tgtacctatt tctctcgtca 
agcagaacat ctggcctatt atataaatga 
acagtagaac aaaatggatt ttttttccta 
cattgtcatt ttttctacct tactcagaac 
actttgacct gtgccatcct tgtgaaataa 
ctaagtgaac atgcagagtc cttcagagac 
ttctacccgg gctagcttcc tccagagccc 
ggagatttta agaacaagtc ttgtgacact 
tttcttattt ccctgcaagt gtgtcagatt 
ttaatgatac tgaatgacat caggactcac 
ggaaaaaata taactgtata agcaatgtac 
aaaattaatt ttgataatca tatatgtaca 
tagacattga cttttaaaaa tcttaatcgt 
tgtctatatg cataaacaca aatctttcta 
atgtctttaa tttatttcaa ttttgtgtat 
atcttttagt atatatttca ttgtgtttta 
actgtcaaac tctcccgtca tgactatctc 
sagcgacaaa ccctcgggta atgcaaattg 
cataaccagg gacttgcaaa catgttttta 



aactcaagtt 'gtttctattc tattagcctt 206S80 
caaaaatgta ttgactttga cctgagaaga 206940 
ctactcctat ttcccataca tatatgggac 207000 
caatttctta ctcttggatc cagtctgatc 207060 
agcacggatg atcattcaaa ttacagctta 207120 
cccctttagg ttgacattat ttattaaaaa 207180 
atcgatagta agtctacttg atacattgca 207240 
tacaaaaatg tatttgacca aatactttac 207300 
cttctttgaa aggattaaag cctccagata 207360 
catgttaaat catggaacag caaaggggat 207420 
atttgaattt gacaaggtga taattaaagt 2074 80 
cctttcagta aagttggtta gcatattaac 207540 
ggatgtaaaa ctgcaaatta ttattcattt 207600 
gtgcatacag tgttacacaa gaggtaccaa 207660 
tgggatcctt atatgacaat ttttgtcttc 207720 
cactttttgc aatttaaaag aaattttgtt 207780 
tcttacttca aatgactaga atcattttta 207840 
acatatatat ttatacacaa atatatgtac 207900 
ctatatgtat taatattgnt atattttcct 207960 
gaagaagaaa gcatcaatgt gtgattaaaa 208020 
atttctaatt tgaaatagtg atcctgcata 208080 
agtacattca ttttaattta gacatttata 208140 
aactgtatga aatttt'ggga aagaactgtt 208200 
gtcacacttg agaatgtccc tggtcctagc 208260 
tggaatttaa agataagaaa tgaatcacca 208320 
atggaaaaaa caaatgaaaa aaaaaaaaac 208380 
acctgcattt atgatatttt cagacatata 2084 40 
aaattgtcac ttttaaaatt tgacatggga 208500 
gcccatagtc aacaggtgat caattgtgaa 208560 
agagtaggag ataaaaaaca acatgattac 208620 
atgtgaaatt gtgtagaaat gggggaaaag 2 08680 
agaaactaag ttggtgcaaa agttattgca 2 087 40 
gttatgcttg aagaaactta aaaggagaga 208800 
ggaacaagaa gtgaaaatga ecaaagaaca 2088 60 
aagggagaat agtaatttaa agtacaatgc 208920 
aagtgaaagy tctcatcggt cctgtctgat 208980 
tcagatcgca catagctggg ctggcactgc 209040 
tccacaggcg actggaaagt ctggattctg 209100 
tggcaaatta atgggaaaac aacctcttca 209160 
cttttcattg tttcatgcaa ttttattttt 209220 
tttcaccaaa ttaccttgtt attggcttgg 209280 
tcccaagaaa ctatgtactt cttctacttg 209340 
gccattagag tgttccacta atatagtcaa 209400 
atattagttt tgaagaagat atctttggag 2094 60 
cacaggaact cttggttttt cctcactgcc 209520 
atgattttcc ctcaactgca cctttctgta 209580 
gtgatatgtg atttatttta rtttctcctt 209640 
catctgtgcc tcctgctgag tctctgtgtt 209700 
agtctcaggt agatctgaac cacacatgga 209760 
tcctatgctt tgggtccaaa aactattgat 209820 
ctggcaaaac tagcttggag tttactgagt 209880 
tcttctcttt tcattttttt gctatgagtg 209940 
tttttttctt aacgcaattt aatctttact 210000 
aagtgtttaa ttttgttact ttttaattct 210060 
tgctattttc acccctgact ttatgcccag 210120 
ctcctatctg tttagcaaaa tgtaaataaa 210180 
tttttttaaa tttacttgcc tgggttgagc 210240 
ctgaaaattt gacttaagac tcagtgtatt 210300 
aggtagctga ggtatcactg acagcataag 210360 
gcatatccga caaccccaag ctaacaagat 210420 
ggacataaac tcrtcttaag gttttactta 210480 
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aagtascatc 

aatgcagtct 

ctctccattg 

acaaaaatct 

gttctcacct 

gccaaacact 

ccaatatttg 

aaatactacc 

tggkttagag 

ggaaggaagg 

ttctttttcc 

atttgttaaa 

atttcttcta 

ggtaagaaat 

ctgcaggcaa 

agtttttgag 

aacaggatga 

a-tgtcgactg 

tcaacagaaa 

gattggcctc 

cccaaatgag 

agcccatccc 

atggctgttg 

aaaacttcat 

attaaaggca 

gctggatgca 

aaagtcagaa 

atcagttggc 

ttttataggg 

gccataatac 

cacccctgaa 

aggacaaatt 

tgcctcttaa 

gagtttttct 

ttcatgagat 

gagtttagta 

atcctttgat 

ccttaaaata 

atacattaag 

ycatttatgt 

gaaataatag 

tagttgtatg 

tcatttagat 

gctaaattta 

tgaaagtctt 

tgtataattt 

taatgctgat 

ttacatgtag 

tccactcatt 

gagagagaga 

agtacagtgg 

ctgcctcatc 

tttgtatttt 

aactcaagtg 

ggtgcccgga 

gaattataaa 

ttagtataaa 

ataaaattag 

tttaaactta 

tattttgttc 

gttgccacat 



catctcatct 

ttcttcattc 

gagtaactgg 

ttccagttac 

accattccta 

tttgcagctc. 

gatatattga 

tttttccata 

gaaaaaaamc 

aaggaaggaa 

ccccaggaac 

tattgtaaaa 

ggaaaccatt 

ctgtcaaagg 

tggggaaatg 

cagttctacc 

ctgacacgtc 

cacattttcc 

tgttgcagac 

ctgtgcttct 

aagatgcatg 

aacagaccca 

ttgcaagcct 

aaatacaaag 

tgtagagtga 

gcctagcagt 

caaaaaaacc 

tcactgaagt 

attcttagaa 

acaaagggac 

ggctgaataa 

caaccttcta 

ctcataataa 

actgactagg 

aaattatttt 

gttacaccaa 

agtttttctt 

aggtgggagc 

gagaggtgga 

aatttatcta 

acaacaggga 

aataacattt 

tatattatat 

tacattacta 

acatgaagaa 

ctacttcagc 

catatggaaa 

taatattagt 

aaactatgca 

gagagagaga 

cacaaccttg 

ctccccagta 

tagtagagac 

aaccggcctc 

ctattcttgt 

gtgattcaca 

ttaattctgt 

ctttggaaat 

caaaagcaat 

cctaatagac 

taggctgaac 



ttaaccagcc 
tttcacccgt 
actgaggatg 
tactggaaaa 
gagtctatct 
.tcgtatttgc 
ggactggaag 
gaggcagctt 
aaaatgggtt 
ggaaggaaga 
atggcagttt 
tactttcctt 
tacccttctc 
cacttcacca 
accattactg 
tgatgaccag 
tcctctttgt 
ttggccagtg 
ctaacatgag 
gccatttgtc 
cagctgatct 
gagtgagccg 
ttgggtttcc 
ctttctgcat 
aagtctgatt 
tgtgctgtgc 
cacaaagttt 
tcaagcagca 
atacctaggg 
acaggctatt 
gtgtgagaca 
agcaCaatgc 
atagcatata 
ctgcgccaaa 
actatcttga 
aatccagcag 
tatcccttct 
cagcatttat 
gaaaaaaaga 
tgtatataaa 
attgtattct 
ttgagtatat 
ccatgtatat 
aaaataaaaa 
tattagaaac 
aaatcttgaa 
ctggatcaac 
ttacttcaat 
tatacatata 
gagagagaga 
gctcactgaa 
gctcagatta 
ggggtttcac 
ctcggcctcc 
actttgggtg 
ccctcttatt 
atatttttat 
aaaaagaaca 
taagaagcag 
tacacaccgt 
tcacatttat 



tctcctaatc 

ggctccatta 

gactcctttc 

tttctgcagt 

atctatttac 

attgtagggc 

taaagaaasa 

gaatacatga 

atttggttat 

aaggaaggaa 

gagaaaatcg 

attgttagta 

cccttctctg 

ggaatctggc 

aaactccttt 

cttactatgc 

catgtgacct 

gctctggtct 

cagaggctct 

ggagcaacat . 

agacttaagc 

aaatcttccc 

tggttatgta 

gagtttctat 

ttacctgaat 

catgggaagg 

gagtgtggaa 

actatgaaaa 

agagaaaccc 

ggtatttagg 

cctggtttat 

aaagtagcat 

gcaatcctgc 

atatttttca 

ttaaacaact 

aagaaaatga 

ccttacatct 

aaaatatatg 

aagatcatat 

tgacttacaa 

tggggtgaag 

tagcttatct 

aaagaataat 

gtgtcatata 

atagggatta 

aagcactctg 

ttgcaaactg 

tatctttcct 

tatatatata 

gacaggatct 

acatccgcct 

caggcatgtg 

catgttggcc 

caaagtgctg 

tctttttttt 

ctaaatgtgt 

ttaattttaa 

gatacggtta 

tatcacaaga 

caatagttaa 

tatttctgtt 



tttgcagaat 
aaygtttttc 
ttaagaaatc 
catcatacat 
ctctgaggtt 
cttggctgag 
ctgccacttc 
ttaggaattc 
atagagaagg 
ggaaggaaga 
ggaataaaac 
gaattccatc 
ggcactrcat 
tcttataaaa 
ggagasggct 
tcatctttac 
tgcagagcct 
tggccattga 
aagtaaacgt 
tattcatgcc 
tatagcctag 
agcagtttca 
taaagcagca 
tctgtgccct 
tttaggaggt 
gacagcatgt 
aagtgctaca 
'agttgtggga 
cataaaaggc 
cattaaaggg 
atttatataa 
atccaatcta 
cacaggcgct 
ggttttaact 
aagtaaagtt 
ctgtaatttc 
gtgactaaca 
catatagaga 
ataatcattt 
taaaaatgag 
tctacaatgg 
tcaaaatgtg 
gatattaata 
•tttgagcctg 
cgtatcttga 
atatgtcatc 
gatcaacttg 
aactctcacc 
tatatatata 
agctttgtca 
cctgggttca 
acaccacacc 
acgcttgtct 
agattacagg 
tttttttgag 
tcagtctaaa 
gagtggagag 
tagttccttt 
tttaaagttt 
agttcactaa 
attagcttaa 



tgtccattct 
tgcaggtagc 
cagaaaagca 
gtatcattag 
tcttagatta 
actagatctc 
acacctatgg 
actgtgactt 
aaggaaggaa 
gaaagatcct 
awgtcttata 
ttggcagact 
ttctctttct 
tgcartaaaa 
tagcttatga 
ttcctcctgt 
tcaggaattt 
cttgcatcag 
gtgtgatttg 
gccacaggtc 
aatcacatcc 
agagcaggga 
ttactcacta 
gcaagctttt 
cagggccaag 
gcccccaaaa 
ggcttctgga 
catattagca 
tagattttct 
aagtttgacc 
atttttgcac 
ttaagcacca 
aacatttatt 
catttactgm. 
ttgtttagaa 
ctatataaca 
cattatactt 
aaactaacac 
atttttatgt 
tgtattccta 
atagatggaa 
cttggtcttg 
tactcttact 
aggacttatt 
ggatgaaatc 
tattgcctaa 
caaactaata 
ttgaaagtac 
tatatataga 
cccaggctga 
agcaattctt 
aggctaattt 
tgaactatgg 
catgagccat 
tagtcataca 
tgyaaaagtt 
atattagcta 
ccccaaatgt 
ttaaggatat 
aaatccagaa 
cttatattta 



210540 
210600 
210660 
210720 
210780 
210840 
210900 
210960 
211020 
211080 
211140 
211200 
211260 
211320 
211380 
211440 
211500 
211560 
211620 
211680 
211740 
211800 
211860 
211920 
211980 
212040 
212100 
212160 
212220 
212280 
212340 
212400 
212460 
212520 
212580 
212640 
212700 
212760 
212820 
212880 
212940 
213000 
213060 
213120 
213180 
213240 
213300 
213360 
213420 
213480 
213540 
213600 
213660 
213720 
213780 
213840 
213900 
213960 
214020 
214080 
214140 



93 



WO 01/40493 



PCT/IB00/O1507 



ttataggagg gatgacttct ttttacaatc 
tgatttaatg acaactttgt gctttatcta 
atctatgttc caataaaact tccaacataa 
ctatatctct ggattatatg gatttacata 
taagtataat atacatgtct gagtataata 
tgcatatcca tattgcatgc- -agctatgtag 
acttaagcaa agttgaatag ctctcaagta 
tccaccattg ctaatgcaca caaaaaaaac 
ctctgataaa attattgttt attcagaaac 
gaaaacaaaa actcctaaga gaatgaatag 
ctcagaattt tactctgtta ctgcaaaatc 
aacgtctctt cttctaaagc tcatgaaact 
atctggagtt cagtactcca tttcattcct 
atttcatatt ttcctttcat gggggataat 
accttataag gaatcataga tcaacctaaa 
tacttgcttt gtctaagctg cttaacaaca 
aaacaaaacc aaaccaaatg aaaa^cctgat 
ttaattattt aaataacaaa tatgaggaag 
acataggaga gagtcttatg taaactaatg 
ggagggcttt gtgttaagca ctgtgaatac 
tacaaarctg agagagatta tgacagttga 
agaagtggaa cttgatcaca ggaagtcact 
taacgcattg attttgagcc cagcagcatg 
gcaagtaact tatttaactt gagtctcaga 
tatagcatca cattctcagt actaagtaag 
agaagacttc acttcatatc tacacgacat 
gaagaattaa ctgaaattca tcccaagatg 
tcatctgtgt tgtttgacag tcacgggtag 
ttaaaacttg aagcacattt aataaaaaga 
gtgcatagaa atggcatttt atcacttact 
cctaaaaaca aggaaacaaa acaataataa 
agtcagtttt gcttctgagt aggtattgta 
ttgctttacc ttactagaca ctgagttatt 
gaaattattt tctctttttg ttaataacct 
aatttataga aaaaataaaa gtaatgcttc 
tagaaattat aaaagtaggg aagggtccat 
atgtatcaga tacataaatc cataaatcca 
tttggataaa ttaatatact tgaatatttt 
cttcacagga cttttgaatt tccgttgttt 
gtatgtgtaa ctttcaataa taatttagtg 
tttgattacc tatggtatta gtttatatca 
tcataaaaat tatatttcat aaaaatattc 
cacgtgttat cagctgggtc ccagacacag 
cagactacat tgtcatcaga aggattggct 
tttggtagaa gttatttcct tgtggttgca 
ggtccacatt cgttctagag gctgcctgcg 
atagtttctt gccatgaaga cttttccaac 
agatccrgtc tcagttggcc aggtcttgtg 
catcaccttt cccatgttcc attggttaga 
aggggttata caaatgtgtg accaccaaga 
aactgccaca actacatatt tggatataat 
aattgtagga attgcctgag aattgaggac 
taaacaagtc tagcttactg agaaagtttt 
gggagaagcc atttgacttg tctttgaggg 
tctctgcaaa acaattcctg gaatatgatc 
acctgatgtg aacacagaat gctttgaaat 
caggtgagga aactgaaaca ctggggttaa 
attttggagc aggaattcaa attcaggcag 
atattaaact tgacatccat cttacatgca 
catttataaa agccagctca gatgttactt 
taatagagtt aatcacaccg tcctgtaagt 



atttaatatg acaaataata tattacattt 214200 
aaaggtagca ttatttttgt gtgtgaatta 214260 
gtaaggctct gtatgtatgt tatgttcttt 214320 
cttgtatgcc agtacatatg tacatatgtt 214380 
tatacattct tactgtacaa actatgcata 214440 
atagactctg acagcataga aaaattttaa 214 500 
ctgatatatg tatgtgtcca atattatgat 214560 
taccaattta tttttcttat ataattatat 214 620 
ctaatatcga ccattaatgt tccagtttca 214 680 
aaattttctt aaggaattat aaagaaccat 214740 
tcatgcaaag ataaccaagt acggtatgtg 214 800 
gctctctgat tcctctagcg ttgacaraga 214860 
ggacatttag cccattattc ttgtatccat 214 920 
tgtaagttaa aggtgaattt gttgactcaa 214 980 
gatttcctaa aaatatacct acatattgaa 21504 0 
acagaaaaaa acaagcaaac aaaaactcaa 215100 
ggcccaaact tgtaaagtgt attttcttgg 215160 
aaaggcattc tagtgaatct gctttaaatt 215220 
ataatgaata tgttcggagg gtttgtatgt 215280 
'tttgttccat ttaaacgcca ccttagactc 215340 
ccaagattac acaatgagaa agtaagagat 215400 
tctaaagact caacatttga ccctggtctc 215460 
tgaatttatt cctatcagga taaatgaaat 215520 
tacatccaaa taaaaataaa tgccctcatt 215580 
ctatctataa tgatacaatt ccattgaggc 215 64 0 
ttcattcttt ctttctattt gttttttcta 215700 
aggcatcttg gctgaggtga gatatgcaaa 215760 
ataaatcata tcactaacaa aagtcagttg 215820 
gagattctcc cagtcacaca ggccaggtgg 215880 
gaccagagag ttgcaaatac aatcccatca 215940 
gaaaactttt ttttttaatt tttaatagaa 216000 
aaataaactt atgggttctt ttaacttttt 216060 
attttttgaa ggaaagtgaa ttataaatca 216120 
taggacataa gaaatacata tttccagagg 216180 
cattattagt attctcccta aaactatagt 216240 
caatagtatt gattgaaaat gtaatgttac 216300 
taacaatggc aaagggagac atcattacta 216360 
cccacataaa ttcactaact attcaaaaaa 216420 
gttttactga ttatgacagc tcacaaagca 216480 
atgaaatatt accaaacaag aggagaggat 216540 
ttg.tgtaaca aatcttcata aaaatataat 216 600 
ataaaaatat aattttataa aacatattac 216660 
agtccgtcag agtccagtga agatgttggc 216720 
gggaagaatc tgcttccaaa ctcattcagc 216780 
tgactggtct ttttgctggt tgtagttgca 216840 
gtccttagag. gcctcatagg caaggaactc 216900 
atggctgctt acttcttcaa gcctgtaagg 216960 
aaaggtaacc tcatcaccgg agtcaagtgt 217020 
agtcccagct cctgcccaca ttcaagggta 217080 
agtcaagatt atgtgggtca cttagggtct 217140 
ttcaaagaaa gactgtgaaa aagcataaag 217200 
gttcatctgc aatgcctgtc aggcttacat 217 260 
aaaaaatggg tttgtatgac tttacttatt 217320 
gaaaaataat ttaaaaatac agatatactc 217 380 
taacagaaaa gcccagcaga cctgattcaa 217 440 
gagtaaattt aattattaaa cctaattttt 217 500 
gtatcttttg cagggtcaca cagctaaaga 217560 
ctggcttcag acctcatgct attaaacctg 217 620 
aatgttactc tctctatcag gaaaataacg 217 680 
ccaaccaaca aatttcctgg acacactaag 2177 40 
actacccaca gtatccgggc tctaccacat 217 800 
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tgtaatgcaa taggtgtgtt cacactgttt 
agcatgactg tgttgtgtgt cccattatcc 
gaagaaatta atcttgatgg ctttcaagtt 
aaaagtatct tctcagtttg agagattacc 
tggggactcc atattgagga cattttcttc 
gatcctaacc taatccttta jgaaggctaga 
cagtggagta agcagggttt tattgtgctt 
caaaagcgaa tttgttctgg aaaataatgg 
tggaatattt tgaaataatt gtaaagcaaa 
atatatacgt aagatgttat gtgcctgagg 
tacccaataa tgtaccaatt acaaatagca 
agaacgggat gaaaaagatt ataatcacgt 
gagttatctt ttcataaaat tagatactgt 
atgtgaggag aatgttttca tgttaatcaa 
cagaaggcag gcaggttggg ggttggagga 
attagttgcc agaaatatat ctattgactt 
tcccaaaata cagctgttta cagtgattga 
tccattagtt ctgccttttt cccctgttaa 
gatcagctaa tgttcctgcc agattcaaag 
aatctcaaag aaatggcttt ttaaatgttg 
atcagaaatg ttgaaataaa tatcaggcaa 
tctataaata ttttatgttt agcccaaaca 
ataaaacttt tattttaaaa tagtatgtag 
tagttttcac ttcaaataat gattatttac 
tgcaaaccgt tttttttttt tctttttttc 
cttaacacat ttaggtatat tttgtaggaa 
gatttctaaa atgtaaattc caaagacagg 
cttacagagc acagaactca gatgctctta 
tgctccatca tcagaatatt attatatgat 
cttagtacag aacaacagca. aaagcctggg 
tggggagcag aagccagtcc 'atttgactga 
aattgcatat atgtgagact ccgggtattt 
tctaccctct gctggctttc acgggttctc 
ttaccctaat tcctgaccac tgtaatccaa 
aggttaccca tggaattgtt tggaaatgta 
aaatagaagc tgtacgaaaa tagtcaatac 
atctcagcac tatgggatgc cgaagtgggc 
gctgggccaa catggtgaaa cctcatcttt 
gttgtggatg cctgtaatcc cagctactcg 
tgggaggcag aggttgcagt gagccgagat 
gagtgaaact ctgtctcaaa agaaaaaaaa 
agacctggaa actgagagcg ttcatgaatg 
ttattcatga tacaaatcaa gcaaataagc 
aggtcaacgg ggtgagagag ccacccagaa 
tggagaggtt tctctgaggc agtggcatct 
caggtgggtt tggggaaaaa agtattccaa 
agcaagggtg attagaatac tttgatgaag 
atgaagaaaa gggtggagag cagattattg 
gacagcatcc ttaaaataat aaggagccac 
ccaatttgct tttttacaag atcactctga 
aagcacgata ccaggtggca cataggagct 
ggatgggagt ggatttgttt cctaggctgc 
aaaataacag aaatctattc tgtcccagtt 
ctctaaagac tttaggggaa aatctgttca 
gcaatccatg gagctccttg cttgttggaa 
tatgtcttca ccttctcttc tccgtgtatg 
caaggacccc agtcataatg gaatagagcc 
tctgcaaata tcctatttct aaataaggtc 
aatatatctt tcttggggat atagtttaac 
gataagggtt agtggaaaga aaccatgtaa 
tcaacaggac ttcatgatta agtctgagtt 



gtctctcctg gtgaatcgtt aattcatcag 217860 
ttagtaaata atgtagtgcc ttgtgcatag 217920 
ccaccccaaa cagtcaagcc atgttgattg 217980 
atcagagtat ttgaaacaag attgtatcct 21304 0 
ataggattct aaggtctaat gacttggcct 218100 
gaataaaatt aaatatgttt ttcacttttc 218160 
cttgaatcat tgtcacacta gtaatctctt 218220 
gttagaggcc tctgtggagg aatacagtag 21828 0 
acagataatt gcgtttttaa gctctttgaa 21834 0 
ttgaacaaaa agataaatag ttacagtcat 2184 00 
ttgtgtgtac atgtcctcta ttataaaatc 2184 60 
taataatttg atttgcttca ttcatctttt 218520 
gttatttcct tctggaaagt acaatggtgc 218580 
tgctgacaag aatattacac ttaaaaaaca 218640 
cagacaaatg aagaagacca gattgaaatt 218700 
tttacatttt tagaagtacc tctgttagct 218760 
taagttcccg tgcttttaag cttattcaca 218820 
aaatctgaag ctgtcacatt gaagtagcat 2188 80 
catgtcccca ggggtggcaa aagtattcaa 218 94 0 
cctgttgaca ttggtcactc catttgctgt 219000 
ttatagaaac aggactttga tgatcaattt 219060 
aaagaaggca tttggaattc actattttgt 219120 
tatattttca aaattggtta attaattttc 219180 
tctattccct ctagcattag attattttct 219240 
tttttcctgt aatagtttta gagtttctgc 219300 
caaagggaaa aggtatcagg gagaggaaat 219360 
gaatgttaaa agtgtttgcc agtcctgttt 219420 
taaagataca ggataaatca catcatttcc 219480 
ttagatcact tttttaaaaa agaacatggm 219540 
gaaggagagg agtgcaccat gaggagtcaa 219600 
tttggttcgt gtgcaaaata attgctaaat 219660 
tcaaaaccag ctggcaaaat tgtgttattc 219720 
tgttctctct ccttttcctc cattctcctc 2197*80 
taatctaagg ttttaggatt tggatgacta 219840 
gacctgtaat ggagagggga gaaaatgaag 219900 
tgaggccagg taccgtggct catgcctgta 219960 
agatcactgg aggtcaggag tttgagacca 220020 
agtaaaaaat acaaaaatta tccaggcttg 220080 
ggaggctggg gcaggagaat cgcttgaacc 220140 
tgcgccattg cactccagcc tgggcaacaa 220200 
aaaaagaaag aaagaaaata gtcgatactg 220260 
ctgcatccaa agaatatttt gtactacgtc 220320 
acactctgtt gttcacatat gacactatat 220380 
acagacacac aattttgtta tcagatattc 2204 40 
gaagaatata cagatgttaa ccaaatgaag 220500 
ggagaggggc agggtgtagg acaac.taaag 220560 
tgaaaaaaac actccaatga gtgagcgcgt 220620 
aggtccattt agatcatgtt ttagatcctg 220680 
taaaagtttg aagaagggca atgacatgtt 220740 
cttctgcctg gagtagcata taaagggact 220800 
gaagtrgctc ttgtcaaaat aaggtgatgg 220860 
tacaacagag aactgtgaat ggagtagctt 220 920 
tgggatggag gtttcagcag gacatgctct 220980 
atgtccttct cttagcttcc agtgttgctg 221040 
tatcactcca tcatcacggt gttcttcctg 221100 
tctgcctctg tacctcttct cctcttccaa 221160 
actctaatga cctcatttta actgattaca 221220 
atattcatag gtattgcaga ttaagactgc 221280 
cccaaacagg ggattattga aaggaagtga 221340 
aggttttaaa atatattttt ggtggtagac 221400 
tagggatgga agcagagtcg tggttgactt 221460 
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ccaggtttct tccagatgat gatgccattt 
agtggtgggt tgagcatggg ttcatttttt 
tttttatgtg gagctgtaga tttggtggtt 
ttgagttgga gagataagtg tgggagtcat 
agtagataac tttcctcaac cagataatgt 
acattaaaca ttttaaagaa_ caatgtggac 
agactaaccc tgccataaga aaaaacaaac 
atggttttag agacaaaaaa catcaggcag 
atcccaacaa ttgcctgaga ttacctaagt 
atatttctat ttaatttgat attgaaattt 
aaaggcatcc aaattgtata ggaggaaata 
actatataaa taactatgaa acatacaaaa 
aggtttcaaa attcaaattt aatatataac 
caatatgtaa ttgaatttta aaaaattgta 
cagataaatc tgacaaaatt tgtgaaagtt 
aaaaattaat gattacctaa ataaatgcag 
atati:cttaa gatgtcagtt gtccccaaat 
gctacattgt ggaaattgac aagctgaatt 
ttacagctgt aacacttttc taaaagaaca 
ttattagcat taagatagac aaatcaatag 
ctatacatat atggacaact gattttggcc 
gcaatatcat attttcaaca aatgatgttg 
aatctttgat ccatatttaa aatttaactc 
tttaactata aaattcatgg aagaaaacat 
taaagatgta ttagatgcaa cagaacaaaa 
ctggacttca taaaaattaa acaatcgcaa 
gcttctgctc tttaaaaatc attattaaga 
agaaaagatt tacaaaggtt atatctaata 
cctcaatgtt caataagaga aaaaaaataa 
aaaaaaatgg gtagagaata- tgcatatgar 
aatgcaaatg aaagcataac aagatagcac 
aatgctgacc atatcaagtg ttggtgagaa 
ctgtaggaat gcaaaatggt taaattactt 
ataaagggtg atctatccaa ctactattag 
tggaagtaaa agcatatctt cattataaga 
cagcagctga aacctggaaa caactctaat 
aaactgtgat gcagtggaat acgactcaac 
taaaagaatc caaacaaaat agagcaaaca 
taaatgcaaa ctaatacaca atgaaaaaaa 
agagggaaag gggtagataa agaaaagagg 
gagaataata ggtttgtggt aaacattacc 
taaaaactta tccaattata cattttaaat 
tgtaaagttg attttaaaaa aagttcctat 
aaaaactggc caggagcagt ggctcatgcc 
aggcagatca cgaggttagg agtttgacac 
ctactaaaaa tacaaaaagt agccagacgt 
gaggctgagg caggagaatt gcttgtacct 
gcaccactgc actccagctt gggtgacaga 
aaaaaactta caccaaggtc aaagcagagg 
gagtgataga gaataagaaa tgagagctcc 
ccttcctcat gctacttaga agttaaaaaa 
tcagaagtaa cattatggac taatgaaaag 
gaaagtaggt tgactgagaa atgatgaaat 
tccctaccca tatctcatct tgaattgtag 
ccctgggagg taactgaatc atgggggcag 
ataagtctca .tgagatctaa tggttttata 
cttgactgct gccatataag tccctttgct 
ccttctcagc catgtggaaa tgtgagtcaa 
ctttattagc agtgtgagaa cagactaata 
cttctaaggg aagagaggag agtagagaga 
cttgttatta agatatcaga gcctagaaca 



gttgagacac aaaatccaac ggatcaagta 221520 
tgacatgcca agtttgatct gtctgtgaaa 221580 
aaaaatgcct ttggaattca aatgagagat 221640 
ttctattcaa attatcatca aagctatgca 221700 
ttagtcagaa gagaagaaac cttgacatca 2217 60 
tatttcagac taatatgaac aaaaaggacc 221820 
cgtcaagaaa cagacaggac atttgaaacg 221880 
ccaaggacag aaaaataagc aaatacattg 221940 
tcaggtaaaa gacaatattg tctgctttca 222000 
gcattcatag tagtaagaca gcaataaata 222060 
aactggtttt atctgtagac aatatcataa 222120 
aggtataaga gctaatgcat gattttagtg 2221&0 
attaaaatca cttgatacaa gagcaaagaa 222240 
cttataatag caaaaaatat aaaatatttc 222300 
tcacacagtt aaaaatagaa aacactaagg 222360 
aaagatacat ttttaatggg ttggaagata 2224 20 
tcatctatag cttccctgtt atcccagcaa 22248Q 
taaaattcct atgtaaacgc aaaagaattg 222540 
aaactgggaa taagtatatt tttaagacag 222600 
gataaaatag aaagtaaaaa aaaaatagac 222660 
aaggggcaaa ggaaacttgt tgggagaata 222720 
gaacatctgg gtattcatta caaaacggtt 222780 
aagttgaatt atagatgttg gtgtaaaatc 2228 40 
atgagactaa cagtgtcaat tagcgttaag 222900 
catgatccat gcaagaataa atgtgataaa 222960 
caacaaaaaa taacaaagaa atgaaaaaga 223020 
aaattaaaag acaattctcc atggtacaca 223080 
taagatttgt atccagaaag tataaagaaa 223140 
aaatgagcaa aagctttagc agaccactga 223200 
aagatgctca atatcattag tcattaggaa 223260 
tgccctcttt tttaaaagtc taaaactaaa 223320 
tatggagaaa ctagaacact catatcctgt 223380 
tgaaaaacaa taatgaagca gattccttaa 223440 
atctattcaa ttgatctagg tatttaccca 223500 
cttctacaca aattatcaca tctttactta 223560 
gcccgtcaac agaggaatgg atggataaag 223620 
gaagatgaga ctaaaaataa ttatactgag 223680 
ctgtgccatc ctgtttatac cttactccag 223740 
ttacttattt gagaactggg gagaggaagg 223800 
agagattaaa aggagcataa gaaaacctca 223860 
gtrggtaatgt ttttagggta tattcacatg 223920 
atgtacagtt tagtgtgtca gttatgcctc 223980 
ttccaagttc acaatttcat ttgaactctt 22 404 0 
tgtaatccca gcactttgag gagcagaagc 224100 
cagcctgaac aacatggtga aaccctgtct 224160 
ggtggtgggc acctgtaatt ccwgttccag 224220 
gggaggcgga ggttgcagtg agccgtgatc 224280 
gtgagactcc atctcaaaga aaaaaaaaaa 22434 0 
agctggcaaa agagactgag aaaaagagca 2244 00 
cacggaagct aagggcagga accacacagt 22 44 60 
aaaaatgaga gttaaaaaag gccattcgtt 224520 
gaatttagtt aaataaggca tgagacaaaa 224580 
agaggtagca agtaatatgg tttggctgtg 224640 
tttccacaat cccacgtatc atgggaggga 224700 
gtctttcctg tgctgttctt gtgatagtga 22 4760 
arggggaatt tttctacaca ggctctctct 224820 
cttccttcat cttctgccat aattgtaagg 224880 
ttaaacctct ttcctttgtc tcaggtatgt 224940 
cagcaagtat aaaaaacatt aaatgtttgt 225000 
gagagattgt tagctgaagg gtaaagtatt 225060 
tatttaaata ctgatgagaa ggatacaagg 225120 
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gagagagaaa atttgaatca atagtttgtt 
aggtttgata gaggaatgag ttttcctcat 
ctggatccta agggtgggat attaagggag 
gatcttggca gcaaggatgt ctgccaagat 
taatgaataa aaacatatat gaaatagcca 
ttagtttaga ccagagtccc _tggagttata 
ttcacctgtg gtactgtgaa caatgacaga 
gtgggatgat aatggcatgt tcaccataga 
gttcaactat tttcacaggt cctgacactg 
gctattataa agcaaaatga aaaaacacaa 
aatactaaga agaaaagtat tgttggaaat 
agaacttgct caaagaggag gctggtttag 
ctggattatg gaaataccca gagaactggt 
tgtttccaga agactttggc atctgcgtca 
cgtgggcagg tgccgtctca tcagctgatg 
atgtgtttct ctccttctta gggctggaac 
gaactccact ttttggactc taggtattac 
ttactcctta gatcttggtc tgagagttac 
tgccaggatc tccagcttgc agatggcttg 
agccaattct cctaacaaat gctatctatc 
tctatctatg tatctatcca tccatccatc 
tattggttct atctgtctgg agaaccctaa 
caaatgtcta ccattgacat tcctctcctt 
tgtaaaattc catgtaatac acaagccaga 
acagctcatt tcactaccac aatctaccat 
ctccaaaaga ctaaaatcat cagtttttac 
cagggaattt taatcttact atttcaattt 
tactaaaatg ataaatacta tgtgttatta 
ttctgaccta ttttattttc aggggtactt 
taaagaagac atcagagaaa atgtcttcca 
cttgactcca actgtattgc aatgaaaaat 
gtcctaggcc ccttcttcta aattccaaga 
aaaattcacg aaagtagtct ctactggttt 
tccagaaccc cagaatgtgc ccfctattt:gg 
agatgaggta atactagagt agagtgggct 
gaagaggagc agagccacag ggaatggtga 
atgtgcctgt cagccaagga aggtggcgca 
agcctggcac agattctctg tccagtggga 
ccctccagaa ctatgagaca ataaattgcc 
tagtgtatta tgtcagccct aaaaagctaa 
gtcaggaaat a.tagatcaca gacaataatg 
taatgtttgt tgctctttcg gtctctccta 
ttgaaaaaac caagaaacaa tacgaacaga 
acctcaaatg gaaagatggg ggctgtgttc 
taaagagata aagataaatg gtcctgtgca 
ctaagtatat gttagtgccc agtgaatagt 
atgaattgat gagataatca atctgtgaga 
tgggttagtt tggctaaaaa ttgaagatca 
tgctacttgc caggcatggg tattggtgtt 
actgtcaagg aacatagact cttggaattt 
tccgccatta aaaaaaaatt atttttaatt 
ataaagcaag ataaatcaat tggtttaaaa 
gcagagaagg ttccagtcta aatgtaggca 
aaaccctatg ccccaaaatg tgtaatcaag 
cctttctctt tggaatttta tataagcccc 
aaagtacaaa tgggtgagta tgcatctgct 
agcactcctc gataccaatt gactattgaa 
agtaaaccac acaattgcag aaatcctcta 
gaatctgaag aaaaggtttt ttagaaaagt 
acaatcattt ttcattgaag tgcagctgta 
ttaaaatttc ttataggctg ctttggctta 



tccactacca cagaaaagga tgagatccag 225180 
tggagagata agagcagatg ggtttgaatg 225240 
tctgtttttt atggctttta ttttctcttt 225300 
ttaagggttt ggacaggagg atcatagttt 225360 
cttcagagag tggggcagac agtatgtgtt 225420 
ggggtttgaa ttcaaatcct agctcctcca 225480 
accaagttaa gaccccaaat cttctttaaa 225540 
cttcttggga gtaataaatg aaatacttta 225600 
ggttttgaat aaattgtgcc caggactatt 225660 
tggcccaatt aatatgtttc caaggagttc 225720 
ggaaatccac attactagat taacatgtgg 225780 
atgctgtgat aattttagat gtcaaatgga 225840 
aaagcatcat ttttggtgtt tctgcgagga 225900 
gtggactgag tgaggaagat ccctccttac 225960 
gcccagatgg aataagatag aggaaggaat 226020 
ttactcttca ttccctgctc ttggacaaca 226080 
actagcaccc cacttcccat agccgagttc 226140 
accattggct tccagggcca cactactcca 226200 
tctggaggct tttcagtctc tataatcacg 226260 
tatctatcta tctatctatc tatctatcta 226320 
catctatgta tctattatct atatctatcc 226380 
tacagatgcc agcaaaccaa aaccaagtgg 226440 
ccttaatgac tcttagtaac caaaaacaca 226500 
aaaggtaaaa tattgtatct gctagttcac 226560 
tattcaaata ctgatttaga aaatgacagc 226620 
ttagaagaaa aagaaatgaa attttaatga 226680 
ataagatgtg actggctgaa tattgttatg 226740 
aatattttgt cattgaaaaa tgacctatta 226800 
ttagataaag gagtttataa aggcttctgc 226860 
attgatttat cttgctttgt ttctgagcat 226920 
aattttttta aatggcagag actctgattt 226980 
gtctatgtga actctttcta tgagtttctt 227040 
aaatggtgtc ccctcaaaat ttatgtcatt 227100 
aaatagggtc attgtggatg tacttagtta 227160 
ctacatccaa tattgctggt gtctttctag 227220 
ccatgtgaag acagaggtag agattgactg 227280 
ttgccggcca ccatcaggaa gtcggagaga 227340 
accagcctgt gtgttaattt cagacttcta 227400 
taaaactgtt ttaagccact cagtttgcaa 227460 
tacaaagccc tagagaattt caataaaggt 227520 
ataggacctt ctagcattct tacacttctg 227580 
tatgctttta ttgtgattgg caaaaatccc 227640 
aacaaagcta tgtacttgtt ttgggaagca 227700 
taaaaggtga ggagaaattt atagcgaaga 227760 
ccactgaatt ctgagaactc tgaacaaaac 227820 
tgttgaatga atggatgaat caacaaataa 227880 
caaaaagcaa cacaaatgag tagacaagag 227940 
cccatttttt cagccaagta ggcaggcatc 228000 
tggcataatg aaatagcaaa acaaagtcct 228060 
agaagaaggg gcatgggaca aatcagagtc 228120 
gcaatacagt tggagtcaag atgctcagaa 228180 
tatctttcac actacccttc catggaaggt 228240 
actggagact cccttttcct atattccttt 228300 
tctattcttg aacagaaaag aacacagtgc 228360 
taagaaaagc tgctattgtc ctagtttgta 228420 
ctgttcagtt ggaaaattag tacaggtttc 228480 
tgcgtactaa gggaagcttc tgtacagacc 228540 
attaaagaaa cacaccataa tcttcctggt 228600 
acatctgtat aaggtacatt tgtgtaaaaa 228660 
ttgttagatg ttgttctaag aaaactcagc 228720 
aaataattgc ctaactggat ggtcacaaat 228780 
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gaccaagaaa ccaaatatat ttcatagtat 
tgcacagtag agatcttttg aaaaatattt 
aaagtcttaa acataacaag ttaaataaat 
aatgttcaag acatagaatc atgccttttg 
aaaaactatg tgcctatttt taggaattaa 
atctcacaaa tcatgttttt ctgtcatatg 
tgtttggctc tgtcacccag gctggacagc 
tcctttctgg gttcaaggga tcctccctcc 
atgcaccaac atatctggct aatttttaaa 
tggttggtct tgaactcctg ggctcaagca 
gattacaagc atgagccact atgcctggtt 
tatgtgactg gcatttgtgt aggattattt 
cagaagtaaa tcactaatta atctgcaccg 
tctatctgat agactgcact gcaggtaagg 
tgggaaaaac aagcaccaag actgaccttc 
tcatttccat ctgttggagg tctcagcttc 
gcctgattga aagaiigtaag ccaccaggtt 
aaagtaagag aagagtgacg ttgttcttcc 
agaaagttgc tgaggattcc tcagaacaat 
tatgaaaacg ctcatgggtg ctgacaggag 
gggctctgta gggggtgaag gaactgtgct 
tgaggcaggg actggaatCg tgctgagact 
tactctaact caatcatatg ccttgggtat 
tggccccaaa tttcatgccc tatttatagc 
aggattcttt ttattatttt ttaaaatacc 
caccgagagg gagaattagg cctgtggacc 
tttttttttt ttttacagat tcatgttttt 
gcttttttgt tgttgttttt attctctata 
agctaattat tgtatcttta catgactcta 
gacatttgga aatgactttc tcctttcagt 
tgagaaattc cagtgcctgg atgttcccag 
gttcaagttt cagatgagaa tttctttttt 
ttaaaatgtc tctgaaagag tctgaaaaaa 
taaaatttta ataaactaat atgagaatga 
tttaatagtc ctctccccta tcataaccca 
tgcattcata aaaatttagt gtgcagttct 
aaatgtgaat aattgaagcc attacacaga 
attccaataa agacactacc aacatcttac 
atattaaacg ttttaaataa aaatgattgg 
aatattcttc accctaaatc acccagatta 
aaatacagcc tttgattaat ttagcggaat 
.cattacagca tgccatggaa tgaaaactta 
tctgtggtca agttcctctc cttttctaaa 
cctctgacaa gggaaccatt ttagcaccaa 
aagatccttt cctgaattaa atttaacctc 
tccacttgag ctatcttttg ggaagagaga 
gaacagttta ataaagcttt gtgagatttc 
tgccctccaa ggctctgcat tcatggatcc 
agtggatata ttaaatraga gtgtggttta 
gtttctaagg ctgacttcac ttctgctttc 
ctttacctca gtgtctcctt ccataaagtg 
atgacagtgc aaattgccat ttagaatcac 
tgttcgttca gaacgggaaa tacaatcttc 
tcttttactt tttaaaagtc atgagatttc 
aggacacttg tgaaaacact ccttcaacta 
ggcttttctc caacttcctc tcttccccaa 
caatcttaac tttttccatt ctggagtatt 
taaatttatg catgtcagat tttgttttag 
ccccaaaccc ttaaactgat taatttcacc 
tatctgactt tattttaaaa tgcaagttga 
tgccctttgc cttgagataa ttttgaagaa 



taactatcat tacagagtgt taaataatac 228840 
taaatgatag attctatttt ggtttgaggc 228900 
ttttaatgaa attttacaag tgaaataatt 228960 
gaaatgactc tcatgatatt ctattatttt 229020 
attgctctgg tagatacccg ataaatatct 229080 
taaatttttt aaatgtttta gtttataggt 229140 
agtgctgtga tctcggctca ctgaatactc 229200 
tcagcttccc aagaagctgg gactacaggc 229260 
ttctttgtag agatggcatc taggtttccc 229320 
gttcccctac tttggccttc caaagtatgg 229380 
agtttctaat ttttacattc agtttatttt 229440 
ttataaccaa tatgttttag gtatgtaacc 229500 
gaggaagtat aatttaggga gagtagaaac 229560 
tgtcacagcc ttgaagcgcc tcagtgactt 229620 
cagatccaag gtctggtagc aggatggttt 229680 
tgacataaag gcctgcttta tacaaaatga 229740 
agagggagaa ctacaggagg cagacaggag 229800 
actagcttgt ggctactaac aaatttcatc 229860 
ttacagagat aactgtaagc tgcagagggc 229920 
cccctggtgg gtggtgggtt gggaatacag 229980 
tgacaccaat caatgggagc ctggagaaac 230040 
caaacgtttt gagttagagt actcaccagg 230100 
aaggaaatta ggaataattg gaactgtggc 230160 
cattcatcaa atttagaatg aggtaaagga 230220 
cacctcattt tttacaccca catcataaat 230280 
tccctttttg tagtacctca tatgtaattt 230340 
tggtacaagt gaacgtttta taacraaaag 230400 
aaaatacctc tgggtgaaaa ttgttaagaa 2304 60 
aatttggtct atttaaaaag tcaaaacata 230520 
actgaagttg agagggagaa ggaaaaaaag 230580 
gctaatttct agacaagtga gttttgataa 230640 
cttctctcat tggttttccc acatcgtgag 230700 
tatatatatt aatgtggaga attatttaca 230760 
gaccctccaa taaatattgt ttagagttat 230820 
tttaaaaatg tgggtatata tatatgtgtg 230880 
tagtttttct ttatgcaaaa cacacaaaat 230940 
catttttcct gctaagttct gtaaatagat 231000 
tttgtcttaa atataattga gagtatttcc 231060 
atttcaaatt gccttaaaga ttcataatgt 231120 
tttctaattg tttctttatt ttgccttcat 231180 
ttatcaatct gagatatttt taccttccca 231240 
ctttgttgct caaatgtatc aaacacagtt 231300 
tttgcttaga ggatctcata aaacgtaact 231360 
cactgcaaaa gcttctgtgt tcctaaggga 231420 
tttagtactc ccatttagcc acctgataaa 231480 
ggtatctggg aacaataaca cttccttttt 231540 
aagatgaaag ataatgtgta atgctgatag 231600 
aattacgttt tttgtcatgg taaaagccac 231660 
agaatgaagg cccaggagtc tggagatctg 231720 
ttatctcact aggggtatgc agcttaacat 231780 
cccagcaaat atctgagaaa ttcttttaaa 231840 
cttggaatgt ctagaaaaaa atgaggtgaa 231900 
tgcttaattt cagatttatc atcaagtttg 231960 
cagaagagga atgacatttt caaagcctta 232020 
ttctttattt ctcaccaaaa ttttgactct 232080 
ttcccaattc caacttgaaa agagtaattg 232140 
ttcttgcagc cgatttgttt catataattg 232200 
atgcacgttt gctggcttta gtcagtcaat 232260 
gtgaaataat tgattacgaa tttgtagcat 232320 
tagaacattg tcatttattg attaatctat 232380 
gcagaatagt gccattagta agacaatatc 2324 40 
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tatgtattac aaatgatcat gttcttaaaa tacacacgca cacacacacg cgcatgcaca 232500 
cactcacact ttctagtaat aagcagtgtt ttgagtaaaa tcttttgaaa gatgtttagc 232560 
aaatcgaaac aatttctgtc ttcctatgtg atgcctgcaa ttatatggtt tagatattta 232620 
ttgaaggact tgggaaacct catagtccca ttggcattat tctcaacata tcaaagattc 232680 
attcagtata aataaataat ttcattactt ataataataa aagttatcat ggaaaatttt 232740 
ataacctagt aaataggatg tcatagaaat attcaacacg gataaataaa atatatacat 232800 
agtttaaatt agaacacctg'ttatagaatg gtacagtcta actctctggc caggtaagct 232860 
tctagcatga aacttcacac aatcatacct acaatatgtt cttgcctaat t'ccttgttcc 232920 
aaaggaaagt ttgataagga agttgaataa tttagcttcc ttttagagat ggtagagctt 232980 
atttaagtag ctttcctgta gtacaaaaag attaggcagc tctgattgga actctagatt 233040 
gaaaatgagt agaaataacc tcttatagaa cacgtctaaa tttaactttg cttcacaccg 233100 
ataaactata gcctctacta ctttatagac agattatagc tcatcagtag gactaaaaat 233160 
cactgtggaa ttactctgca atagtcctgt tgctatgtct ataaatataa tttcagtgag 233220 
tgacttttcc attttctagc tgcttggcta tacttttttt gcgtggcaaa ctaaagggct 233280 
ttaggaatat ggtaaatgac aaatttaaat agcctcagat aaaacatgaa agtcaacatt 233340 
aaattatttg gttaaacatt attttcagaa caattttaaa aaataatatt ttgcaactga 233400 
ttttaactta atgtttatac tattctaaaa aggtaaaaaa aatgcaatgt tgtttattct 2334 60 
cgttaagcct aacatgaaaa taaaagcatt atttagggga aaaagtccca tttaattatg 233520 
tgaacacata attaaataat tatcataaga ggcaaaaaaa gatagatagc atcagatatt 233580 
gtagcttggt agagcttcag tactaggggg tcaaacagtt gaaaataatc tgggaaacag 233640 
ttttaagtgt tttatattta aaacacctac acacacaaac acacacacac acacacacac 233700 
actctcacgt aggcagtgtg aaatttacat tgttcctttc ccatgttatt gttgtgtatg 2337 60 
ggcaggctta tttttcacat atttccctct caatagcctt ggggttctaa agagagacac 233820 
aacattatca cccaagatgc ttgacaaaga tgggtatttt gagggtgatc cagatttact 233880 
gatgatatga ggcctggaaa tctgaaaatt taacaaactt tcctagtaat ttttataaaa 233940 
aacaaaattc aagaaaaact actgcatgaa aaaatgtgaa gaagccagtg gatactttta 234000 
aaaaagtgtt aaaatggttc ctatgtaata gacaaataat ctctgagcta gataaaagtt 234060 
caacatgaag tataatctaa tattaaaaag cccagagaag ttgctttcgt aagttccaac 234120 
attagcattt agctttctcc tggttaaaag tttaaaatga agaagataca tactctgcat 234180 
agggctgagg aaggtaagag acccaaggac ataatgatct ctgtaaatgc ctctgtgcca 234 2 40 
tttccaaata gccgtcctca tgccttctct tccatccttc tttccttgtt accgtctctc 234300 
ttccttcttc tgtctccttt gctgcaacag tatttaagtg gatatataat acatattaac 2343 60 
tctatcacat cttttaacat acatgttggt ttaatgactg tggagcgatt gtaggaattg 234420 
tccattttac ttaatttttc agaggttttt ggttaagaca aaattccctc .agagactcaa 234480 
atgtcatctg ttgattgaaa tatacatctg tgttttgaga ttaagcccaa acgaagggtg 234540 
gcttctctct ctggatcgga ctatactttg agtcatttag ttacagtaaa acaaaaatat 234600 
caaaatggca cgaaaaaatt aactgtggtg cctaattctt tgttcctcat gaaatggatt 234660 
gcctatacca tacttaaata acatgtgcat agcctttcca aagctccaaa cctcacgatg 234720 
aaatagtcct ctcgtccctt ccctgtgtga gtgttgggaa gattcctttt ccctgtcaag 234780 
gggcctttcc tccggtgttg accacaaggt agttttttga ggggatctta agttgtatta 234840 
gtattttatg agttcttaga tagggttaaa taattcctct tactttggcg tggaaagaga 234 900 
atggagtgtg atagtgaatg tgggaaaaag ttgtgaaaga ctgtgaaatg agctgtaaag 234 960 
attactaaat cacctgacta gcagagtctt ccaagatttc tctgcctttc atcatctttg 235020 
agctattttg caactttgta aaattttaaa aatatgatat aaatgcaatt gactttcttc 235080 
cagaaaaatg caaagagagt gtccagtgaa aatgctaagg attattgctc tgtgaatcga 23514 0 
ccagttttac ctgtagatga ttttaaattt gcctttgaat tggttgtaac atcttattgt 235200 
ttctcttatt aatcaagtaa aatctttgat tcatgttcat gatacatctg tatgagctta 2352 60 
ccatgtcttt aaatgcctta tcatgttaac ataacctact ttctagacaa atagsactcc 235320 
ctttcttatt ttcaagtaag tagcaaacac ttatttgaaa ttctgagccc atcaatttat 235380 
tttttatatg aaaggatgaa catttcctag aagccctctt ggtatcatct aggtaaactt 235440 
catatcaaat ctgtgctaat atgttatttg tcctggaact ggttcttagt ccaaatttta 235500 
taccatttat tcgactctta tatccttcac cttgatcaat ataaagtgca tttgccacac 2355 60 
attcggtttg accaatatta ttctctgtat agtggcttta actcaattat caaactgtat 235620 
tttaaaaatt gaatggccag gtgtggtggc tcacatcttc tataatccca gcactttggg 235680 
aggccaaggg gggcggatca cctgaggtca ggagttggag accagcctgg ccaacatggt 2357 40 
gaaaccctgt ctctactaag aatacaaaaa aattggccgg gcacggtggc tcatgcctgt 2358 00 
aatcccagca ttttgggagg ccgaggcggg cggatcacaa ggtcaggaga tcaagaccat 2358 60 
catggctaac acgatgaaac cccatctgta ctaaaaacaa aaaattagcc aggcttggtg 235920 
gtgggcgcct gtagtcccag ctactcggga ggttgaggca ggagaatggc atgaacccgg 235980 
gaggcggagc ttgcagtgag ctgagatcgt gtcactgcac tcgagcctgg gccacagagc 236040 
gtgccatcaa aaaagaaaaa agaaaaaaga aaagaaaaca aattagctag ttgtggtggt 236100 
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gtgtgcctgt agtct.cagct gcttgggagg 
cgcagaggtt gcagtgagcc cagatggcac 
gactctgtct caaaataatt taaaataaac 
gtgtacatgg gtgagctgct ttattttatg 
taccgtctag cttctataat ttataaatat 
tgttgtttgg cattttgtta gttggcttaa 
agggcatttt aaaattatta gccatggggg 
gccagcatgg ctacacagag aaattccagt 
atatgaggac agtgtgctta atacataaca 
ttgttattcc ttaagcagtt atttttaatc 
aagaatataa ctctagtctc ttcttgcaaa 
ttaatccagt aaagaaacat gtccagcctg 
ttaggaagat taaccaagta aaaatccatt 
aatcatccaa gaaataaaaa actaaaaata 
agtgattctc tttgcattca atagagactg 
tgctttggaa agtttggaaa tatgatgaaa 
ttcagttggt tccccacaca tcacaaaatt 
aatttagggc ttttattagt gctatgacag 
gaatttactt taattatttc agagtaggga 
gagtttgtaa aatctgtaaa ctctcacaac 
gtcccctata ataagctcat tacttcataa 
aatgtttaaa attttcacca ctggtacatt 
gtgaatacct gaaagaaata aaataggatt 
tcattattag aaaataaagc ccaaataaaa 
gaaaaatttg caagttgaag aggagaaaat 
tggttaagac agtgtggatg gtgataagag 
tttgaattta gtgttctgcc tagtatgaat 
tatctaatga gatgataact ctctcctacc 
tgcttatata catgctgaaa catactgatg 
ttaaagtaaa gattgatttg ataaaaaata 
acatttcatg ttcatgtcta tttacagcta 
ccaagatctc gcttcttttc atgtctggct 
atattttact gtgattccac aaaccagaga 
catggccatc attagaagac aataagcttg 
aggctaagtg gaagaaatta acatactgca 
tttgcttctc aaattattcc accaggcatt 
gtcactctca gctaacattt gtaatttttt 
aatataatta tcctctttaa taaaagtaaa 
gtaatcatca atcttggtca gaaattaaaa 
agactgacca tctgcacctt tgaggcaaaa 
atcaatgttt taattatgct atcaagggct 
cccaaaaatg tccagttctt aatcctgaaa 
tcatagatgt aattaatgtt aacgaccggg 
ggtccaaggt agtcacaggc atctttataa 
agggggatgc tggaagaaga gtggtcagag 
gcaggccacg agccaaggca tgcaggaggc 
actcagccct ggagcctcca gaaggcatgc 
taagatgtct gtcagcctga tgagaactat 
actaaatttg tgcttatttg ttatagcagt 
gcatgtcaga ttttaagtta tctgctatag 
ttgtctcttt gaaaaaaaaa aaaaaaaaaa 
gggctgggtg ttgtggctga cacctgtaat 
gcactccagc ctgggcaaca tagtgagacc 
aagaattaaa cgttgacaga ttattaagtg 
cgcctataga tatttaaaag ttgatccagg 
aattcccact tcacaaaaat aaatcatgga 
aaaaagagat atcgatctct -tcataagtta 
ctgtaagtgt gaatggttgc taacactcca 
attttcttcc tgtatcttac aaattaattg 
agtattatgt agctttacca cagttttatc 
agataaagag taacatacca atagagttta 



ctgaggcagg agaattgctt gaacccagga 236160 
cactgcactc cagcctgggc aacggagcaa 236220 
aaataaataa aaatagaata ttttggttgg 236280 
tatgtagaat ttctaaaata attgtgcttt 236340 
gtaataaata gtagataatg ttaatgagtg 236400 
aaatgagaat gataaaaaag gcatggtatc 2364 60 
taaaaaagaa aaaggaaagt gagacaagtt 236520 
ttccccaaaa ctggaaaaga atagcatttt 236580 
aagatagctg ctatatatta gaaattatgt 236640 
ttgatagttt attctggtta gatatataac 236700 
aaaaaaaaaa aattccaatg agcaagatga 236760 
gggacaaaaa aaaagtagat caaattttgg 236820 
agctggaatt tattgtaaaa gccaataaga 236880 
gccctcaata tggaataaaa aaagacatag 236940 
agaaaacaca actgtcagaa tattaaatat 237000 
gtaaattaga caattgctag aaaajatatcc 237060 
cagcttgata ccattttctt tttcttgaca 237120 
aaagaaggaa. tgctaccaaa aagaccattt 237180 
aacgataatt ggaaagtagt ttataacctt 23724 0 
gatgtgaagg cagagaaagt atcaaactca 237 300 
aatatatatc cccttcatct atgtgtcaga 237360 
atttttgggc cttacatttt tgtggatttg 237420 
ggataatgtg aatttctttt acactcattg 237480 
tatgacagag cagaattagt tttaatacag 23754 0 
gcagccagga actaacggta cacacctagg 237 600 
aacataagga aaattctgcc ctccagctac 237 660 
attattgaac cctatggtat ggtacttaac 237720 
tgtagaaaat gtagatgcct tgtctacttc 237780 
tctgctgtct tcttaatttt acactaattt 237840 
tgcacttgga aaaattat-tc tgctattccc 237 900 
acaagactag cacgcaaata gatatatgct 237960 
tctccctgtt tacccacaat tattttcata 238020 
caaatctctg aagggaatag tgttccaagg 238080 
ttttccttcc aacattctta gctttgttga 238140 
tgcctttcaa tgaaaattgt ttctaattat 238200 
gattagactt tttaaattca tttcatatct 238260 
aattgttgtt aaagagaatc tattaattga 238320 
taaaattact tgggtagcca acatattcaa 238380 
gtaagatgag catctaaact tggcactcaa 2384 40 
taaaattcag tttaggaaaa ccttatgaaa 238500 
gtggtaggca gaaacttctt tgctccaccc 238560 
acctgtggaa tcttgtgtaa caaagggact 238620 
aggtggagag tgacctcggt tatcctggtg 238680 
tcagtgaagg tctccctgct gtgcttagag 238740 
aagcaacatt gctgactcya aagacagagg 238800 
ctccagaagc tggggcaggc aagagaatgg 2388 60 
agccctgccc attccttgct tttagccagg 238 920 
gaaataatga atgtgcttgt ttttttcacc 238980 
gatagaaaac taagacaagg ctatatcttg 239040 
tttagaaaag cccanagaag gagctgcaat 239100 
ggatcacctt ctaggggatt aaacattaat 239160 
cctagcactt ggggaggccg aggcgctgct 239220 
ctgtctccag aaaaaaaaaa aaaaaagaaa 239280 
aggcactgtg tgagacctag cttgggtttg 239340 
ttttcacttg aaagcagaac tactgcctta 239400 
ttgagggaaa aatttacttg tttgctttat 2394 60 
tatatgtttt acaaggttgc actgtttgaa 239520 
gaaaccaaga ggtacatttt aactggaaaa 239580 
aagtttgtct acgcttgaaa atatgtcaaa 239640 
taacttctag aagaatttaa attcatataa 239700 
gagagttttc agatttgctc agaagaatgc 2397 60 
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tcytttgaaa acctatgaag tagattttac 
aaaatatcag tcgtgttcag attcctttat 
atcaatggag catataacat gtatggttta 
agccaaatac gtcttatttt tttattttta 
ttttataaat tgctcagtca tacagttggt 
cttactggct ctcaaacctt^tgctttttaa 
actgcaaaat aagcacaaag ttatcccagt 
caaacaatta gcacaggcag atggagtgag 
tctgcctgct gtggattaaa gattcaaact 
gcctatgtct cctcctcatt aacatgagtg 
gaatatggga gaaagtaagg ctgaaccata 
ttccacttcc tgtctccagg aacattcacc 
acattgcact gaggccgcca tgctgtaaaa 
gagaatcagc acccatcagc ttttccagca 
ttgtgaagca gagatgagac ctcctgccat 
atcatataca ctagtaagct tttactttaa 
gcaaaagatc atcagaacac tgccatgcag 
gtatccaatg actactcaca cacacattac 
cagcaaattt taagaaatga aaatgttatt 
tgtagttaat ctctttgtaa atggtaaaat 
agaagtttgg ttctttaaat gaaagaagtt 
cattctagac atgaaaga'at gttgaagaca 
actgttgaag tttggttttt tgtggatact 
acatcttgtt caggtttcta tgtttttaaa 
tttttaggat atatatctgt atatctgtgt 
ctacatattt tgccaggaga gactggataa 
gtttatagca tccaaatgag caattgctga 
gtctgatctg gcctgagggc aacactctta 
ttggctccag gccacaataa agagccctga 
acattcatgg gaatatgagg atgacttgtg 
tggtgtaggg taagctcagc tcatctctag 
tctgtgcagg tggcacagga cagccagaga 
tggactctcc actgagggag gtagacaagg 
cttactcagg gagactgaga ttttgctcta 
aatttttcac aggacatgac tgccatagcc 
rtgatagcat cataagataa ttttgaatat 
tatccaatat ggtaacccct agcaacctaa 
tagttcctca gtgccactag ccatatttca 
catgttggac agcactgaca tagaccattt 
agtccacctc tggaaccaat ccggagtctt 
teagagcagt actctgctta atgcttagtt 
agagaacttt aaaggcagct agttcgtagc 
tcaggaactt ggtgttctag tggtgacttc 
accactcatt tatccattca gaaagtattt 
ttcagacact gaggcactgt ggggaacaaa 
acattccatg tggtgatgca gactgtaaaa 
aggaagtttt atgaagaata tttcaaaagg 
tgtttgcttg agaatgaact gttcatacaa 
tttattgagt acctgttatt agcaggctac 
agcacaatta aaacacaata acatttgtta 
taatgaatca gagtctcgag gttatggtcc 
taattttaag gtatagccag aatagagagg 
ctagacaatg agaaatcatt aaaatgaagt 
aaaattaata atagcttgga gaggaaagaa 
ttaaatgtat ttagataata gttacagagg 
aaagaataaa aataaacaaa aggacttaaa 
acccagaaat tttcaacctc atgtctaaga 
aaccaaaaat actacgagaa gtaggttgtg 
ctttttcttt attttggatg catgcttttc 
caataaagaa aaacatgatt ttatgccaga 
atataatttt agcctcccac ataatccaaa 



ccaatgtata tctggatcta aaaattaaag 239820 
accctgcaga agagagaaag aataagcaac 239880 
gcactttcat atttttaatc ttaactattc 239940 
tagatgaaga tattgaggtt tgcatttttt 240000 
aaatagctga caggagagtt cactccaggt 240060 
aaatctccaa actagcgcct ctccctttta 240120 
tctacacagt gctaaacaaa ttcctgtgag 240180 
ggggaggtgg tttataacaa tggaaagctg 24024O 
ctttgatcct cctcccaggg ataggtgagg 240300 
atccttggct aggttctcct ttgactgaca 240360 
tactgggctc aggcttccaa ccctgacatg 240420 
ctgggagctc tgtgctgcca tgcaaatgtc 240480 
aaccccaagc tacctacctg aaaagctgga 240540 
ttttgggtcc tcccagctaa atcatcggca 240600 
gccctttcca aatcctggac caacagaatc 240660 
gccactaatt tggggtgtag ctcttttgca 24072O 
gagctgataa acttgctcac tgttatgtat 240780 
ttgcttcact ctactttctg gctttccatt 240840 
cgtgggaaat taactcaaga tcaatattgt 240 900 
attaaattat tgtgagtcat ttttatttaa 240960 
tgggtctttt gtggatactc tgcatataga 241020 
aagaatgttg aagaaatata gacatgaaag 241080 
ctgcatatag acattctaga ctggagtgat 241140 
atgataactt gacacgccta ccaatgccaa 241200 
gaactaagga aaattcagga atgattaaac 241260 
tgtaagtctt tatatttttc acaacttcaa 241320 
catacctagg ggatttctct gcaggttcaa 241380 
ctgagtgtgt tcctaggcac tccttggcag 241440 
tgattctctg agcaggccca acttccattc 241500 
atcttcagga cttagataga ctctttcagt 241560 
agacttccca cagtggagct catgcatgac 241620' 
caggcaccta aggagtgcac taccagcagg 241680 
gagaaagcca ggtgtccaca tggagcaggc 241740 
cacatcataa aacgtggagg tcattaatgg 241800 
atgatagggt ccctggctaa tctttcaatt 241860. 
gaggagagaa gaataactct agaactgcaa 241920 
ggctacttaa actaattaaa attaaaactt 241980 
agtactcaat aaccacacac acttggctac 242040 
ccattactgc agaaagtcct cctatttgac 242100 
tagggaccta aaagactgag gtgctaaatc 242160 
aaaagacttg ctaatctata tagttgatct 242220 
aactatgttg ctggaccttg aaactgtatg 242280 
tcttttatct accatgttca cacttttcct 24234 0 
cagaagtccc tatgatttac caggtgctat 242400 
acaaacaaag tctcctttct cagggatctc 242460 
attcacacac agaagatact ttcatacagt 242520 
ggtgtggggt gggagaggtt gatgttcagg 242580 
agaatcaata attcatttgt tccacaaata 24264 0 
tttattaggt tgccagagca gggttttctt 242700 
gaaattcaaa gacccatgtc cactcttgca 242760 
agagactggt gtatttaaaa atcactgaag 242820 
acaccggcat tcttcctgaa actcaacaca 242880 
aaatattttg attgagattg ttttggcttc 242940 
ggtaaattat aaaagacaat taagaatctc 243000 
gacaaaaatg agtaggatct gtgataacag 243060 
aaagtaagaa taagaaagaa gtcaacagta 243120 
gaatgatgat gctatttagc aatatatcca 243180 
ttcacttgtg atgttgtaaa tttataatta 243240 
ctttttctga agttctgaga catacaaaaa 243300 
aaataagata ggaacaattc ctcatgctca 243360 
atatatgtgc agcaaattgt gggagtttcg 243420 
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aaattagatc caacttctta ttacagttaa 
ttaggatctt ttttggaact tgcagtataa 
tcaaagtcca ataaaacagt aaactggact 
tctgggaatt gagccagact tttcaaatta 
ttaactaaca ttattaaata aaagcacata 
gttaatgtta acaaatgttg_aagatgtgtg 
aagcagataa gatattctct tgagcatgat 
aattttatgc aatccccttg gctgtgggct 
aaaaaaatgc catttacaca gttgcctaaa 
agtttttttt tttctttcgc ctccttttct 
tttgcaaaaa caaacgtagt ttaaattgta 
ttatatattt atggaaacat ttgtgtgtgt 
gaccctgttt taagtcatat ttcatttgct 
ggcattcact ttttgattca ttattaaggc 
tttcctgatc agagtagacg tttaggccac 
tgaaatattt gatctctcct tacccattct 
tt&gtgtggg gtcctgatct tcccagtgac 
ga'tgagttca gacaagaaga aggtgatgat 
ctgaggaagg aatagtttga ggctggagag 
ggtatttatt cttgaaagac agtgtaaagt 
gtagaaatac ataccatgac gtgcttcaac 
tgtgcatctt gaagaaaatt ccaaaccctc 
agaaattcag agggttttaa ataagaagca 
ccagcagtca aaaaaccagt gaaaagatat 
ttatctgaag ttcttgactt ttaaagtcca 
acattaaatg ttaaagaaaa acataagaaa 
attagtattt tgttaccaac accaaagaca 
gtttaagtga ggaaaaaaaa aaacacgtta 
tttacttttg gcagtgttat ttgctaatca 
atgtattata aaggttgtat cttctcattt 
tgtacactat aaatgcagca agaccaggtc 
ttttcttgtg tgtagggctc tccctcccac 
cccctggtct gtgcttgtat aatatacaag 
tccttcatta tttttaaatg gttataatat 
tagattctaa tattatctta ctctctgaaa 
ttgtctatgc tgatgcaaca tatgggcagg 
atatcgttgt acacagattc attcatttct 
aattaatcac tcgtaagtgg ctnigtaattt 
acjtgctttag tgagcatcta gccatttctg 
ttcaccacct tcttcttcca caaatagact 
aatgttacaa ataacatttt taacgcaata 
tgcccagact taagccagcc aaatgtgtgg 
aagataggga acattcataa caaatrtaaa 
cataccatct ctccaaatct ggttttaaag 
cttctggaat tgagatttat ggaaaagaga 
aaggtctgga ggagcctcca ggcacaggca 
gttctgaaaa tgctgcttta gcaagcacac 
gattaatagt taccattttt ttcttctctt 
ttgctctgct tacactgaaa tctcctgctg 
ggagagtgaa acttagtgaa gaaatgattt 
acagttatct aaacagaatg agtctaaagt 
tgctgcttat aattgcagag gaacattttg 
actttttata aaacataacc ttttaagaga 
aaaaaagtaa ctgaacagca agggattgta 
caaatagtat ctcaaaattg atcaggttct 
caagtgatgg aagaggttgg aagtgaataa 
tgggggtagt gtctggagct gagcatgatg 
ccagtatttc gcaaatcata tgtcacgtca 
cctacgccag gtttggtgcc agctctaatt 
aatgcacatg agtgcacatt ttcatatata 
aatctttcct acttagaagt aataaaaaac 



agataaaaag aaaacaaggc tttgtctttt 243480 
aaatactctt tcaaaaaaca taactcaaca 243540 
aaatatattc gtatattagc ttgacctaag 243600 
tactatcatt atttacagaa aaaggaaatg 243660 
tacacatgcr taacaggaaa caaaatgaat 243720 
tgtgtgtgtg tgtgtatgtg tgtgtataca 243780 
tggattggaa atagtagcta tgccagggga 24384 0 
acacatttat tgtgttatac aaatatgcca 243900 
acaaaattga aattatacat tagtgtttaa 243960 
taatgaatgg cagataatga aatttaggag 244020 
ccttagttgt ctatgaaata caaatacatt 244080 
tcagaaagtc catattttgc attcaagatt 244140 
acactcaaat tgactaaagg cagatacttt 244200 
attattttga ■ ttgattctgc ttaaggtagt 244260 
ctatctagac aaatatccca ctttcctatc 24 4320 
agggttttgt gcatctctaa gatgatcatc 244380 
ttagcagcac agtaatttgg ctgaaatgtt 244440 
gacaagagag cgtcacgggg aaattctgcc 244500 
atgtgctacc cccgccactc tttctcaaat 244560 
gtctgaaatt gcagaagcaa agtggttgct 24 4620 
tcgtttcctt aaaggttaaa gaaagaaaaa 24 4 680 
attgtgagga gtaatccaat atttcaattt 244740 
attattctag gcaaaaagca aatataaatg 24 4800 
gattaaaact gagatgtaaa aaatagatat 24 48 60 
caggcatttt cccttaagaa aagaataaaa 24 4 920 
tattataaaa aagataatta ttcagaaaat 244980 
aacagagtat ttttcaatga ctggaagcca 24 504 0 
ttattactgc tattatccat tatttttatt 245100 
gctaacgcat ggtagtggac ttgatctttt 24 5160 
taagggaata ttttaactac attgtccaat 24 5220 
aagtgaaact tcagtgggcg agtctctttc 24 5280 
ctgctacctt gactttttag attccctgtt 24 5340 
ctgtccttga catagcacgt tcctcttttt 245400 
actcttaaaa acaaatttga atattactaa 24 54 60 
ttaagtagtg atgtatattg tgttaaacgc 24 5520 
tctctctctg ggcctatatg tatttgtttc 245580 
ttaaatgggt atataatatt tcagtgtata 24 5640 
aatttgttgc tacttcctcc caagatgaac 24 5700 
tatctgtcta cttattgatt tatgcatata 245760 
acacatcctc ttcatgagag tctacacaaa 245820 
gttcccttaa gctcacactt gtaatgtttc 24 5880 
aaataaattc catttgtgta ttaaatatca 245940 
satagccaag gattcattga aacatttcca 24 6000 
agattgtaac ataatttcag gaataaagac 24 6060 
ttatagctcc acaagattct agggatgcaa 24 6120 
ctttgtttcc atgctgaaga ttcaatgtgg 24 6180 
atttgttcca ctggtgttgg attttctatt 24 6240 
aggaaagaca acctgtaggc acagaaatca 24 6300 
actttgagag ctgagcattg gattttgatg 24 6360 
gaacctttgt tgatctaata cttctgtgat 24 6420 
cagtataaag acctgtgtct ctccatgttt 24 6480 
agatagaaaa ttttatccct ctttcaagtg 24 6540 
ggtcattctt ttgaaaaatt gtctgaatac 24 6600 
gccataatgt tcttgaccac tttagattga 24 6660 
ggagaatcta ttgaggttgt atggagtaac 24 6720 
ttttatgtgg tattatgtga ttttttttgg 24 6780 
gagcctagat ggggcagcct tctaatgctc 24 684 0 
aaattgtaat gacaaagctc acagcagctg 24 6900 
agagcattga tcatttctct ccccttcatg 24 6960 
agggcaaagg acatttcaaa gtttggatgg 247020 
aatactttga aatcttacca aatgttaaac 247080 
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aaatagcaac aatttttcca tttgaagatg 
tttctatttt ggacatggaa actacagatg 
tcctggctga tctcttttac ttaattcttc 
tttattggta atgcttaaat aacaaaagtt 
tttatactta atatctaatt taacgcttgt 
tctaaaatga cttgttcagg^gctccttttg 
aatattgtta ccatgctcat ccaataacac 
aattttgtag ttgaggagca ttgaaaatat 
caaaatgcat tttatcataa tttaaaattt 
ctgataaatg tagttttatt tgtttgcttg 
ataccactct gtctttatga ctcatcggtt 
ttttaaataa gtttcagaat taatacattt 
ttaaaatgtg tttgtaacaa gagtcagcac 
ttayaaaact tataagcaga gtttactctg 
tacaacatat taagtattta tcccatgcaa 
tgatcaaaac gatttacagg tttcaagaac 
gggaaaatga taatagtgaa ataattacat 
taattattct gaaggaaaga caectacatc 
gaagcctttc caaataattg ttgatgggat 
ctgggcttgg tatgggggag atacaactgt 
gaacttct'tc aatggagaga agttcctgag 
gtgatgctat agaagccaac cacgtggcta 
agggggaggc attagaggga tttagtgtgg 
cttgcaactc tctctcaggt tgcagcatgg 
acgcatttcc agtagtccag gagagtggca 
ctggggggaa gaagtggggt aacaacaccc 
aaatatttca aagcaggaat gctattccag 
caaccaaata tcgtgggaga ctgacgaatc 
ggttttaaaa eactttcatg tctatgtaaa 
gcctttggtt atcagatagt cagactttta 
gtctgctttt aatagtagtt ttctcttctt 
tgatactttt ccttgttagt aagcaaggcc 
aacacaaaga atagtgcagc caaacccgga 
tctcactgag tatctggaaa gctcattgtt 
taactcagct atagtcataa tgtaactgac 
caacagagta aatgctgttt ggggtgcaag 
tctcatttaa atgtatttca agttgacatt 
actaaaatct ctgcaatatc taatcaatat 
caaataggat ctgtttatat tttagttaag 
aatatttact ggtaggaatt tggtttaaac 
acaagtaagt ctttaaagac tacaattgga 
ccagcacttt gggaggccaa agtgggtgga 
ggtgaaaacc ccgtctctac tgaaaataca 
gtaatcccag ctactcagga ggctgaggca 
ttgcagtgag ctgagattgc accattgcac 
tctcaanaat aaaaaaaaaa aaaaangaaa 
atcaaattta ggagataaat ggtaaataat 
tatttgaaat gaagaaactg agatcacaca 
tttctttctt tct.ttctctt tctttctttc 
tctgtcaccc aggctggagt gcagtggcac 
aggttcaagt gattctcctg cctcagcctc 
ccacgcctgg ctagcntttt tgtatttttt 
caggatggtc ttgatctcct gacctcgtga 
gacgtgagcc actgcgccca gcctattttc 
attgatacgg aatattttac atatttatgg 
aaggtgtaat tatcatcacg gttttgggga 
atgttttaag aacatttcac attctctctt 
tgttaactat agtcactcta ttctgccatt 
aggtgtggtg gctcacacct gtaatcccag 
ctgaggtcag gcgttcaaaa ccagcctggc 
ctacaaaaat cagctgggcg tggtggtggg 



tgtgtcttaa ggtctgttgg atctttcaaa 247140 
aaatttaact tactcaaagt tacacagaac 247200 
taaaatctaa agtaagaata attttgaaat 247260 
actttaatat gttacatgtt tgaagtttca 247320 
gaaaatatta ttcagtattt ttttttcttt 247380' 
ggattttaag tgagaaaaac tctatttcat 24744 0 
ttcatagatg gttagccatc tctaaaatta 247500 
cacctatatt tggatttctg tttaacactt 247560 
gatttaaata caaacctctt tatagaaaac 247 620 
cttttttttg cagggaggga tattcccttg 247680 
atttataaat gttatcatcg tttttctgat 24774 0 
aactgaataa atcaaccaat aatgtcaaac 247800 
ttataatgaa aaatataatt aattaagagg 247860 
cttacagaag ggcaaacatt tgttatttga 247920 
tcactgtcct aggctttgag tttatagtga 247980 
cctgctcaca tacagcttac aatcctatgt 248040 
taattaatat atacttgatt caaatgtaaa 248100 
aagagagctg gcttaaattc ggggatcaag 248160 
tgaaagacaa ttgtgggatg agatgaccac 24 8220 
cagtgggaaa gccctggaga tggggaagaa 248280 
tgagkagaag gcaggtgacc acagtgtgaa 24 8340 
tgcaagcaga aggaatttat ccaaaaagaa 248400 
agagtggtgg tgacaagatc agatttgcaa 24B460 
gggatgatgg gatgagccca gagtggactg 248520 
tgtggcatga actgtgtgag gaagggggag 24 8580 
tatttgttgt tttgcttctt acactttgaa 248640 
acatttaaca cctggaacag gaataagtac 248700 
agaatgaaca tcagctctgc atggcagtat 248760 
caaagtaaaa atgttaaaag attattttac 248820 
gtagtttata ttttattgca gtctgatgag 24 8880 
cccccaaccc cattacatta aaatggcaaa 24894 0 
aagttaggat tatcagaaag tcaataccta 2 4 9000 
aaatattgtc tgttttgcca agaggaagca 24 9060 
tctcaattat ggttatttta ttagttgcca 249120- 
cagcaccgat gatgaaatca ataaccacta 249180 
aaacatcatt tttatctaag tgaaagtaca 24 924 0 
ttgtatatat tgttaataaa cttcaatttt 249300 
actgaatgat atgctctccc cttttgtgct 24 9360 
aaatgctatg ccattttatt atattagaaa 24 9420 
tctccctgtt attcagctgg gaagtatatt 24 9480 
gaccgggcac ggtggctcat gcctgtaatc 24 954 0 
tcaggagttc gataccagcc tggccaacat 249600 
aaaattagcc aggtgaggtt gcaggcgcct 24 9660 
ggagaatccc ttgaacctgg gaggcagagt 24 9720 
tccagcctgg agaacaagag cgagatttcg 24 9780 
aaaaaagact aaaattggag agtttaacgt 249840 
atgtttgtat atgaaaatat tttaatacaa 24 9900 
actttttgag cgttgagttt atacatttat 24 9960 
tttttttttt tttttgagat ggagtctcac 250020 
agtctcagct cactgcaacc tctgtctcct 250080 
ccgagtagct gggactacag gtgtgtgcca 250140 
tagtagagat agggtttcna ctgtgttagc 250200 
tctgcccatc tcggcctccc aaagtgctgg 250260 
tttcttttac ctatatcatt tcattttttt 250320 
agtacatgtg atattttgtt acatgcatag 250380 
tatatatccc cttcagtatt tagcattttt 250440 
ctcgctactt tgaaatatac aatacattgt 2SO5O0 
gaacattaga acttaaaata ctttccagcc 250560 
cactttggga ggccgaggtg ggcgggtcac 250620 
caacatggtg aaatgtttgt atttgtaaaa . 250680 
cacctgtaat cctaactact caggaggctg 250740 
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aggcaggaga attgcttgaa cctgggaggt ggatgttgta gtgagcccag attgggtcat 250800 
tgcactccag cctgggtgac aagagcaaaa gtctgtctca nagaaaaaaa aaaaaaaaaa 250860 
gaacttatag cttccaacta actctatgtt tgcacccatt gatcatcttt tcttcattcc 250920 
ccactctcag ccatacacac ttcccagcct ctggcatcta tcattctact gtctacctac 250 980 
atgagatcaa catttttaat tcccatgtat gagtgataac atgtggtgtt tgtctttctg 251040 
tgtctggctt attt cactta^ acatagtgac ctccagttcc atttatgttg ctgcaaatga 251100 
cataatttca atgntttttg' tgcacaatta gtattccatt ttgcttatct atttgttaat 251160 
tgatggagat ataggttgat gctatatctt tgcaatggtg aatagtgcta cagtaaatat 251220 
gccagtgctg gtctcccttt ggaacaccga tttattttcc tttggataaa tacctggtag 251280 
tcgaatttct ggatcatatg gtacttctat ttttaatttt gctcagaaat ctccatagta 251340 
tttgttcata atggctgtac taatttacat ttccaccaac agtgtataaa agctcctttt 251400 
tccctgtacc ctcaccagta tctgtcattt ttttgttttt tttagtaata gccattctac 2514 60 
tggggtaagg tgatatcaca tagtggtttt gattcacaat tccctgatag ttagtgatgt 251520 
tgagcatctt ttttatatac ttggtggtta tgccatttgt atgccttttt ttgagaaatg 251580 
tctactcatg tcctctcatg tcctttgcct actttttaat aggattattt gtgagatttg 251640 
tgaggttgtt ttttgtttgt ttgtttgttt gtttgttttc ttacagttga gctatttgag 251700 
tcccttgtat tttctggata ttagtcacct gtcagatgaa tagfettgcaa atattttctc 2517 60 
ccattcaaca ggttgtttgg agattctctt taaaatagct gcaacaaaca aacaaacaaa 2.51820 
caaacccggc caagcacggt. ggctcatgcc tgtaatccca gtacttaggg aggctgaggc 2518 80 
gggcggatca cgacgtcggg agatagagac catcctagct aacacagtga aaccccgtat 251940 
ctactaaaaa tgcaaaaata ttagcctggc gtggtggcgg. gtgcctgtag tcccagctac 252000 
tcgggaagct gaggcaggag aatggcgtga acccgggagg cggagcttgc agtgagccga 252060 
gattgtgcca ctgcactgca gcctgggtga cagagcgaga ctccatccaa aaaaaaaaaa 252120 
aaaaaaaaac cctaggaata aatgtaacta aggaggtaaa atatacctac aagttgcact 252180 
aaaacacact gacaaaagaa attgaagagg gtgcaaacaa atggaaagat atcctatgct 2522 40 
catggatgaa gagacttaat atgattaaaa tgatcgtatc atccgaagca atccacacat 252300 
tcaatgtaat ccctatcgaa ataacattat tttttcacag aaaaaaattt aaaaatcctg 252360 
aagtttgtat agaaccaaaa aagagcgcaa atagccaaaa ctatactgag caaaaaggac 252420 
aaagctttag gcatcacact acctaacttc aaaatacaag gctatagtaa caaaaatagc 25^4 80 
atggtgttgg tataaaaaga cacatagaga ccaatggaac aaaacagaga atctagaaat 252540 
aaaaccccat atttacagct agctgatttt tgacaaaaat gccaagaaca tacattgggg 252600 
aaaggatatc ctctttaaaa aatgtttgca ggaaaattgg atttccatat gcagaaaaat 252660 
gaaactggac ccctacctct caccatacac acaattcaac tcaacatgga ctaaagactt 252720 
aaacgtaaga cctgaagcta taaaaccact agaaaaaaac agggaagaca cttcaggact 252780 
ttgttctagg caaacatttt atgggggaga cctcaaaagc acagaggaca aaaacaaaaa 2528 40 
taaataaatg gggctgtatt aaactacaaa gctgcagagc aaaggaaagt attttcttaa 252900 
tccttactgg ttcttaaaaa gttttggatg ttgaccaaca catctattat gataattaaa 252960 
tgatttggac atgtcaagct tgtcaggata tgcgcaggtt gccg^cttta ggaagagcac 253020 
agtgactctt tctaattcta tttttctatt tcttaaacat agcrfcgactat aaaattgcaa 253080 
agaaatttgt gtatgagcta gggaagatgt tgggttgggt tgagattcaa ggaaggagtt 253140 
ggactttaaa atggtggaat caccagtaga ggacaacacc gtgtaccaaa aacgttcaca 253200 
ggacaagcca ggttgtacac catgatatat gagttgtgat gctgatgaag gacatagtaa 2532 60 
gcttccaata atgggtcaaa aaaaaaaaaa agaaaaaaag aaaagcatta atattaggca 253320 
atgtattttg gtgcctataa tttatattct tatcaatacg gatagctcca tattgatctg 253380 
tttatatata tctatttgtt tatatcattt gattaaaata tggcattgac cattggaaag 2534 40 
gtacatctca aattatcctg gttaaatgtg aaagtatgaa gaggctgtga ttataagcat 253500 
ttaagtttca atcaagtagc aagtgctgtc aagatggtat atccaatcag atattgtgag 253560 
ggacatgtga ttgctataga ttattctgct gtcttaagag ataggctgtg gatattcaga 253620 
ttgtgatttt ggagtttaag agggcaaagt aaattgcctg ctgaactcac agtcatgttt 253680 
gagagagact gacaggaaaa ggtaaaaatg aatcatcatc tacaactggc tggattggac 253740 
ctgacggaat aagatccagc aggaaaggtg tgtgactacc attcctggaa ctgggttttg 253800 
ctgaatcatt ggacagtttt gctgaatcat tgagcataca agttttgagg gatacatgtg 2538 60 
catttagaaa gtaacgaata agaatattca taattatctg ctacttttca tagcccatgt 253920 
attaacacca aagcaatttt agatgagcta tcaataaaaa tattaaaaaa gaaatcacag 253980 
tctatttatt taacacacat tggttatagt caacaaggta atgtgttgtg ctgcaataat 254040 
acaagtaaac atccgtctaa gaatccattg tttagtaggt attactgtac gaatttttaa 254100 
caaagctgct ttaaacacta taccaacagt tttaaataca aaagcccagt ggtcaagctc 254160 
taatataaat gaataaatgt ggccttctgg aagaccctag agagacaaca ggacacagag 254220 
tggagcacag gaaagaaaga agtccacttc ctgcataaat ggcggcggca actctgtttc 254280 
acctgacaga cgcctcctcg aaatgcagtc ccatttaaaa cattttttga ttgttttaga 254340 
taaaacttat aacattgttg aagaagcaac taattctaga tatttttata attttaagct 254400 
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gccagaaaaa 
atgccaatat 
tggtttgttg 
aaagcaaaaa 
gtgtatactt 
atttgaagtt 
tctctttatt 
tggagatatc 
ctggcagagg 
tacaataaca 
gaggccgagg 
tgaaaccccg 
atcccagcta 
cagtgagcca 
caaaaaaaaa 
tagcactttc 
tattttatct 
g.ttgttgcac 
gcaaaataaa 
atttttgtaa 
cctacttact 
ggaacattct 
tatccatttt 
ttactttgga 
cattttcrtt 
tatttctcct 
aataatgatc 
ggcttgaggt 
tttctctgag 
taaaatgaac 
gcaaaatcaa 
atagtccatt 
tttgtctagt 
gtaagctaag 
tcagaaaaca 
ttcattcaaa 
tctctgcatg 
taaagtagtt 
tattttgtag 
catcttaacc 
acatggggta 
gaaaagtttc 
atgtttctga 
gtttattctt 
gcagagccta 
ccgaaagggg 
cttgacaagt 
ttgtgttaaa 
ggtgctttgg 
taataagcat 
tgtgtagcag 
agatgtttgc 
agagaagatt 
cacagtagga 
tttgcaagat 
aacaaatgct 
atggtgggga 
agcactagag 
gagttcaaac 
gataacccat 
gtaatcatta 



gtaagtccac 

tgagaagatt 

gaaacacact 

taccagaatt 

tatatattgc 

tatgcccact _ 

tttcctttct " 

ccatttccca 

aagattccca 

gactattaag 

agtgtggatc 

tctctactaa 

cttggaggac 

agattgtgcc 

aaaaaaaaaa 

tcatatccct 

acctactctc 

agtatgtgtg 

gccattggtt 

ccgaaaaagc 

tgaatttgga 

gtaaaatttc 

gccaaattat 

tattggtttt 

tttcttcttt 

caataattga 

aaggcaaaga 

ccagctggtt 

atgaaaatga 

tcattttgga 

aatgttcagt 

attgatttta 

tctagaactt 

aatatatttt 

gctatctcat 

aaatattggg 

attagagatg 

agtatagctg 

atattaatat 

aagtgaaaca 

tgtcacaggg 

agaacaaaat 

attcattttg 

gcatgcagaa 

actctcattc 

tgaccaaaag 

ttatcgcaat 

ctttatgatg 

aggaaacatt 

atggccacaa 

gccttgcaga 

tatacatgag 

tgtatctctt 

actcactcaa 

atgactttgg 

aaagaaacta 

ggggaggtaa 

atagtgggtg 
cgtggttttt 
gaggactaag 
tgtaaagata 



aattggaaat 
attccagtgc 
aaaccactgt 
ttatgtttac 
cggaattgac 
tttataaata 
tatttcagtc 
acctgctccc 
gggaactatg 
gctggtgcag 
acctgaggtc 
aaatacaaaa' 
tgaggcagga 
actgcactcc 
accacaaaaa 
tgtttgctta 
tgacttcctc 
tgggtaaata 
ataatgaaga 
tcatgatttt 
gtttattccc 
cttgaggaca 
ctgcaagtag 
cttttgaagg 
ttaaatataa 
tatgctacat 
gtgtttcctg 
attaagccgc 
acccaaaaaa 
tgcagaagac 
ggtctctgct 
aatcttgatg 
ccatagaata 
gtatgcattt 
aacaaaagaa 
gaaaggtgtc 
aatcacttaa 
cccccatctc 
tactaatata 
atatatttta 
catagtagac 
taaccagtag 
atttaaatga 
gaatccagat 
ctggacagca 
attgaagaaa 
gtttagaatt 
gatatgggtg 
atcttttgtc 
acttaaatgc 
agatttgtag 
gattttttaa 
gtgttcagct 
tttttgttga 
taataaatgt 
gtttccaggc 
ctacaagatg 
ctattgatgt 
ccacctgtaa 
ttggttaatg 
agctattatt 



ggcccaaggc 
aaaaggtcca 
tccataattc 
aaatgtgagt 
ttatcgaatg 
aaagaatcat 
tacaactact 
tgttcagtta 
aaaggagata 
tggctcacgc 
aggagttcga 
attagccggg 
gaatcacttg 
agcctgggcg 
acaaacaaca 
agaatagtat 
tggtttggag 
gggtcccttt 
ggaaatcaga 
atgtgaatat 
aacactgcag 
aatatccgct 
aaatatcgaa 
acagttatta 
agcaatgtca 
taaaggaaca 
ggaactaatg 
aggaaatgct 
aggcaaaatg 
ctttatgtag 
ttttgtttaa 
caactaaaaa 
ttatgtccaa 
aacattgaaa 
tacaggcatt 
tcattaaaag 
ccttcttaaa 
tgatttgtag 
aaaatgtagt 
tcatattttg 
aactgaaaga 
tattagatta 
atgacagaat 
aatttatgtg 
cagagtgact 
gatcaagcag 
ttgttatatt 
tccttcttaa 
tgccactata 
attatgttgc 
gaaaaatcat 
aaagcagatc 
gaattatagc 
aattattcaa 
agcacactta 
acaaatgact 
ccaccctgaa 
tatctagcaa 
atgaggaaca 
tagagtactt 
aattttaatg 



ccccaggatg 
tgtaactttc 
cttttttttt 
ttttaataga 
ataatttaac 
tttggttgtc 
cccccttctt 
caaggaggga 
gtgactactg 
ctataatccc 
gaccagcctg 
tgtggtggtg 
aattcaggag 
acaagaggga 
acaacaacaa 
ttgaataata 
gggggtgaat 
tctctactca 
ggggaagagg 
tccattggct 
cttacatgca 
ttgttgtatt 
ataagaagct 
aaatagcttg 
tcactttttt 
caaaatggtc 
gttgcctgag 
gcaggccaag 
ggtttttctc 
aaggaatgta 
aatatagtat 
tatcatgcaa 
caattgtttt 
atggttactt 
attttctttt 
atgcgttgag 
gaaactgcct 
taggattatg 
tattactata 
ccatttttgc 
gctcctaata 
taacccaaaa 
acataagtca 
gatgccctgc 
tccttccaaa 
ttatcatgag 
caatcgagga 
tagagaaagt 
aatgtgacta 
caagaaaata 
aaaacaaagt 
tagtagaata 
atctaacaca 
cgaatatttt 
attcattgaa 
tgcttaatag 
ggagtcactg 
gactgagggg 
acgataggac 
agatagctgc 
gtcataattt 



taccttacct 
attaacacta 
ccagttactc 
aacatgcaca 
aagattgttc 
ttcctctctt' 
ctggcccatg 
gggaagacat 
cagagcttgt 
agcacttcag 
gtcaacatgg 
agcatctgta 
gcacaggttg 
aactccgtct 
aacagactat 
taacggccta 
catcagaggt 
tcttttctat 
tctatfcttga 
gcttagcttg 
cca.aggtttg 
ctttgttatt 
ctttagcaat 
taggattact 
ccctgtatta 
ttaattatgc 
aggaggtgat 
atttgtatta 
cactaatggg 
ttccataaaa 
agcacaaata 
cacttgagtt 
gatagctttt 
ctgaaaatat 
ttttctggag 
atttgtggtc 
aatcaatttg 
caaagcagag 
tttactttac 
cacccctgtg 
gtcaaaactg 
tacaaaataa 
atgggggaga 
cttcaaagag 
gagtacagga 
taaagcaaac 
ctttacaaaa 
tcaaaagccg 
aaatagtacg 
tgttggcagg 
acatagtgtg 
tgatttactt 
gcctgtgcca 
attttaggct 
cactaaagga 
cacaaggcac 
tttcatctgg 
ccaactttct 
tggcctcaga 
ctaaaactat 
aaatgaacat 



254460 
254520 
254580 
254540 
254700 
254760 
254820 
254880 
254940 
255000 
255060 
255120 
255180 
255240 
255300 
255360 
255420 
255480 
255540 
255600 
255660 
255720 
255780 
255840 
255900 
255960 
256020 
256080 
256140 
256200 
256260 
256320 
256380 
256440 
256500 
256560 
256620 
2566B0 
256740 
256800 
256860 
256920 
256980 
257040 
257100 
257160 
257220 
257280 
257340 
257400 
257460 
257520 
257580 
257640 
257700 
257760 
257820 
257880 
257940 
258000 
258060 
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tctaggatat agttgcagac attgcaatac 
ctatccagga gagaaattgc aacacagatg 
tattatggtt attaattcaa gacctataaa 
tgtccatatt taaaaacttg cttaaaattc 
gtctcttgtt aaaaaaataa caaatcagtg 
tgaaaagttg atagccaagt aacaaatgaa 
tgcaatggaa aaattaacac "agggatttta 
taattcttat tttgacaata tttaaagcca 
atccttaaat tgctgtatct cagggtacta 
tttcactcac ttgggcttta ttgaagatat 
tattataatg ttgtttctga agaacatacc 
tttatcattt agatttctac ttttagtgta 
aatgacatat acaaaaagtt gaagatgtgt 
atcaaacata tttattattt atcttaggtg 
taaactatat attactttta cttgtcacca 
aatcttttca aattgtgtgg cttcaacttc 
gaccatattc tatcaaggac acaaatactg 
cacagggaag gccttagcct gccttttgga 
tgtacgataa gtagaagagg aagaaataca 
cttattgacc ttaacttcct caatcaacta 
atttttcctt ttttcttatt tatttgtttt 
tttgtcacag atagatttta acagcttgca 
cttctattaa aagattggtg taaagaaatt 
cagaactttg ggaggccgag gtgggcggat 
ctaacacggt gaaaccgcgt ctctactaaa 
cgcgcctgta gtcccagcta ttccggaggc 
tgcagtgagc tgagattggg ccactgcact 
tcaaaaaagt aaataggata attatattct 
ctgacatgaa taagggcaga gacggtagca 
ttcaaggacc tttcattcga gcataggaag 
aggccaaatg tttgaaatcg tgaaagcttc 
gcacaagtga tctacagtga ctttgtggga 
acagtctata acttgtattt gtttcttttc 
gcctatgtct gctatctcct ggaccctccc 
ctacagcctt tcactgtaat aaaccattac 
gagtcctttt agctttcact gaatctaata 
aaaaagaggc tctagagttc gtcagctggg 
gaaatgctga cttccttaca aaaactaaaa 
ctagtttgaa taacacataa caagttttga 
tcatggattg gaggacagtg gcagggccac 
ccagtttaca gacatcagga agttgccact 
aaattcttcc agttaaaaat ttatttatgt 
cccgtggaat attttaagtc tcacagaaaa 
aaaatcagat tatttttctt caaacatcag 
ttatagtgat tgatggagag aagttatttt 
taccagagca gacagctatg agctgtcaga 
ggaccttgaa agctgggaga gaaaggcaca 
cactttcagt ttcactacta gctgccccta 
taaccgattt ttaacccgcg ctgagtcttc 
caatcacgtg tctacactag atcaagtgca 
gtagggtgta aactgtcact aaggaatgtt 
tcactgtact tgaggcagta ttggagaact 
ctggaaaaga gacggctttg tttcctttca 
tctcctctgt ttttctatag aacccagttc 
gtattattaa tgatca-attc atctactcct 
ctccttcatt ctccactatc tacacagact 
aatgccttct tcatatatct tgactagtct 
ctttttcata agagttgtgg attagaaaac 
ttgtttgttt tccttaaaaa attaatactt 
tcccagcact ttgggaggct gaggcgggcg 
tggctaacat ggtgaaaccc cgtctctact 



accaaagtca tttatgttat aatttatatg 258120 
tgcaggcacc ccactatcac cccaaattaa 258180 
atagcatgtg tcatttataa aactctcaaa 258240 
ttaaaaatca aaaagttaat gggttttgaa 258300 
taaatgaatg acagtttaat atcaaaaaga 258360 
attgttaagc agggttgaaa aattaccaag 258420 
tatttgtaat tattgattaa cttcttctat 258480 
aatattggca tatattaaac tgagaaccga 258540 
aaccaaaatt ttctaaaacg ttgaaacaaa 258600 
tattgcttta tcctttagtt agaatgatgt 258660 
ataattaatt aatgttaatt ccccatttct 258720 
tttcatgaag gacatgaaac attactattg 258780 
taatgttctt ttattaggct gatggaagta 258840 
aagtttaaat aataaagtct tgtatactat 258900 
tcttttctct aagcctggtt tagaggtaaa 258960 
caacttgctc accttctctg ggtaagagtt 259020 
tattccttct tcccagggca tttatatagt 259080 
cttccctatg ctttaagatt tttttcttta 25914 0 
tcaaaagaag tttcgctgca attcaactac 259200 
aaagtatctt tctacatcat tcatttcagc 259260 
tcttttttct tttctatcat ccatttgact 259320 
ttgtatgttc attttcctaa aatattagag 259380 
aggggtccag gcggctcatg cctataatcc 2594 40 
cacgaggtca gaagattgag accatccttg 259500 
tatagaaaaa tttagccggg agtggtggca 259560 
tgaggcagga gaatcgcttg aaccaggagt 259620 
ccagcctggg caacagagca agactccatc 259680 
tatattcatt aacatatttt gatggatccc 259740 
agttaagatg ggccagttgg atttcatgcc 259800 
aaaaataaga gtatacatat ttataataca 259860 
cttctgtatc tcaaagagca atcaactctg 259920 
gcagttggca tctgaaaaac atcttaaaaa 259980 
acttacatgt tatgaaagaa gagaaataga 260040 
ctagtacatt tttccattgc taactttart 260100 
taagagtgtg ttagtgtttc taaattcttt 260160 
tttgtgttag gaccccccca cacaccccta 260220 
acattcaagt ggccaggccc aatatattca 260280 
gctttataat tttcaaaaaa aaccccaaac 260340 
gtgaacgctt agcactgtct caccatgagc 260400 
acacgtgccc tccctgtagc caaatgtagg 2604 60 
gcgatgcagt cattcttgac ctctgtgaag 260520 
ctacaagcat cctctttcaa atgcaaccac 260580 
gtcaccttta ttttagcatg aaatggctta 260640 
atataatttc cattaaggaa atagaagtct 260700 
ggcataaaag gaaaatgtgc ttgggtggaa 260760 
gagaagaact tgtggggagc agaggagagg 260820 
aaaacagcaa tgcaagaagt atattttaac 260880 
cagagtgatt gtgattaaag aaactcttac 260940 
catgtatctt agcttgattt tttttgccct 261000 
ccatctacca aatcaaacac tcgtttttaa 261060 
tgggtcaatg tccacctata gatactgctc 261120 
tatgttttct atggaagact tagagcatta 261180 
tattaaatgt caaggagagg cggtaaagcc 261240 
agggctgtgt ctcatcactt actatcctgt 261300 
caccagactg caaagttttt gaggaaagaa 261360 
acgtataaat taacaatgaa taaatgaaca 261420 
atatgttgtt agcattatat ttaaatacca 261480 
tgtgaaggga ttgtttgttt gtttgtttgt 261540 
gcggccgggc gcggcgggtc acgcctgtaa 261600 
gatcacgagg tcaggagatc gagaccatcc 261660 
aaaaatacaa aaaaacatta gccgggcgtg 261720 
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gtggtgggca cctatagtcc cagctactcg ggaggctgag acaggagaat gacgtgaacc 2 61780 
cgggaggtgg aggttgcagt gagccgagat cacgccactg cactccaccc tgggtgacag 2 618 4 0 
accgagactc cgtctcacaa caacaacaaa aaataataat aatactfgga caggaagacc 2 61900 
acagctatca cttctagatc taaacattct tcaagatgag tggagacaga ggtagagtgt 261960 
gtgctggatg gatataggag tggacttact tggttgtcat catacggttg taccatttct 2 62020 
ttttaataat gactgaagat gacattggga ataaataaac atagaccttt gagtttcata 262080 
gccacttaac tccctagttt'tgtttttcaa attagttgag aggtgatact tatagccata 262140 
actttaataa cattaccatc atcatctcaa ttgtggcagc tacaatgtca agaaactcac 262200 
aaagttaaaa tgtgaagaaa ctttagaggt ggtttggtgt ggagaagatt gatgttgata 2 622 60 
cgattcatgt atctttcagg attccatggg attgcactct gtcatcaatt tgctctaagg 2 62320 
actaagtctg aaaaaactta ttgtgtgtca^ ttactgaaat actcagcagt ggtttggttt 262380 
ctgaatactt ttaaagcatt ctctctcttc tccgcattat accataaaat atatcttagc 2 624 4 0 
atattattgt tttcacttgc catttgttta atatatgaga atatattaaa gcatatgcct 262500 
tgaatcaaat gctaaccaga ttaaactata tctaagagag tgtgtctttt caaaacagat 262560 
tgtgaagttg tgttctggtt cccatattac tgccattgtt caatattttg ggacttctat 262620 
tttggaaatg cattcagttc caacttgtag cccctaagaa atgctcccta catctaaagg 2 62680 
ctaagatgcc ctcctttatt atttttctga tcaaatattt aagaaataag cttcaaatat 262740 
ttgttaaatt tttctcaaaa gtaaaatttt tatgtttaaa atatgacaat ctaattgtca 262800 
tctttaatgt attaaaatct tttatatcca gtttttattt tattttattt acttatttat 2 628 60 
ttatttgaga. tagggtctca ctctgtcacc caggctggaa tgcagtggca caatcttggt 262920 
tcactgcaac ctctgcctcc taggcccaag tgatccttcc accttagcct cccgagtaac 262 980 
tgggactaca ggcccacatc actgcgcctg gctaattttt atatttttag tagagacggg 263040 
gtttagccat gttggccagg ctcgtcttga actcctgacc tcagttgatc caccctcctc 263100 
agcctcccaa agtgctggga ttacaggtgt gagccatata ttcagttttt agaagatcat 263160 
tagaattcaa atgttgccag actttaagtg tgggtcacta tctgatggag ggcttctctt 263220 
agaagatgtc cccaatggca gggccttgtt ataaaggttg gaagagagat aatcctctga 263280 
ctaagaggcc agagaagcag aagatttttt ttcatcagat ttcacgtcag taggaagaca 263340 
gaaaggagaa ggaagagaag gcatgaacag catatgtgtg aagtagaatc ttaatagtgt 263400 
gcatctttta agtcaggatt ccgtctactg ttggccaaaa taaatcatta ttatctggtg 2634 60 
gttggaaaaa ataaggacat tccttggttt tctccaaaaa ettattttgg tgtttcttaa 263520 
ttacccttag gagaccttaa tgatcaacac tgtatcaaca gagaaagtgg aattttattc 263580 
tcttatagaa cagtgatgtt attatgcaat gtattcacat aattatttac agttcttcag 263 640 
ttctagatga ctgatactac agaaacctga gataatccct gcttccatta gtttgggtgg 263700 
aacttttgtt gagagacttg agtcatactg aataccttga attactgcca agagctggtg 263760 
atcatcaagc ttgagttaat atcaatgtta tgttgctatg gagaggataa ctgcaggaga 263 820 
acaatcatca tgtggtaggt tttaccacgt ggcctgctga actccatgct caaactccaa 263880 
ctacttttaa gtaatgacat taaaaatcat attttccatt tatctgtgga agctagtgag 263940 
taaagaataa gatagagtct actgaaccta tagaatgtta ctaagtatta ggtgttgtat 264 000 
ttaatttcca gtttgtctaa tgctggaggc aattttacct tctcctacca gtgaaaaagt 264 060 
ttatgccagt aagacttgtg atttttgttc tcccccaaca catgcttcta tataaattta 264120 
ttatgggaaa agtgatatta ataaaaactt tctggaacaa ttatagatag agtgtggtct 264180 
ggtgtgacag atattttgtt tcttcacaac attcattctc tgagtattca ccatggatca 264 24 0 
gatattctgt agtttctgta gtagatacaa aagctttaca aggtgtattt ttccacaaag 264300 
aggttagcta gattaatagt gagaggatac atcaaagaac atggaaatga tttttagaga 264 360 
aatgagagca aatactaaag .tattgaatta agtaaaacaa gaaaaatagc tcctcccacc 264 420 
ttcaaaaaca ttggtgagac caggattgtg gagggcacct tggcaatggg ccttgtagct 264 480 
cagtgaatgg ataaaatgtg tacacagtga gaagagaaga aagtgtctac acagaagtta 264 54 0 
gtttttgttt aatttgatat tttctcttta ectcgttcta gtcaagatga cctgaagcct 264 600 
tctagtgaag aaccagccag gcaagttcca cttcttaggg ccgttaaatc ccttctctta 264 660 
ttaatgatgt taattgcatt tggatctcca tccagatatt tgccctacct taattteatg 264720 
ttaatgtgcc atgggctaaa gaggcagtat tttttaaacg catctgtaag tttcatttat 264780 
gacagaattt ctgtaataaa agtgtagtca agagtgttga tggaaaatta gggtgacata 264840 
aatacgtgag aactggaaaa tgtataatct taaacagtac acattatttt tattatctct 264 900 
ctttcccaca ggctgatgag gcctgggata ctaatatnaa aaaccaaaat tagaaacttc 264 960 
tcctctgtgg tgtttattgc ttcgcctgca gtg'catgaca tcttaattct aaagctctna 265020 
actcacaaaa tctgacatcg ttttaccctc tgacacgtca aaatcagggc ctaattgntc 265080 
ctgtggttgt gaattatgaa caaatatact ttctaagaag atggattttt catttaattt 265140 
tccttttcag cttcacaata tgtacatata cagaggttgc ctgcgatttt aatatgctca 265200 
cacctacaaa aacacacaat acacacaaac tttcaaacgt aagcacacag atctgtaaac 265260 
atacacatga agatttgttg caacacaatt cattctcaaa tctatttcct gttatgctta 265320 
tgattaagat ctataacttc tcagggttga tttatttgtt ttgtacgaga ctgagtaaag 265380 
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ctaaagacca 
taatctgcca 
tataaaaata 
aggttattta 
cacagttatg 
tggctccgtc 
aagcagaaat 
catacatgtt 
tggaccttat 
cttctctcta 
tacctttttt 
gctgacagag 
ttaattatga 
aataattcca 
gtaggacctt 
caaatgtata 
tttctctgta 
tatacataca 
ttgctgttgt 
ttacttttca 
taaaacattt 
gtatcttaag 
ctgatctttg 
catttgggag 
tgtttaaaat 
acacatttcc 
tttgtttcct 
ttagttttga 
taggcatata 
agggtctctc 
ttaacttccc 
actcctggca 
gagtcaag-at 
tactttgtgc 
tccaaaccgt 
ggaaagcgtg 
attttcctac 
tgctgtgtcc 
tgtaatgaag 
tttcaatcca 
ttgagaaacc 
agtttgctac 
aagatgttaa 
cacacacaca 
aacattataa 
taatttaagt 
tatacttact 
tgatcttatt 
tagattcacc 
attaagactg 
tttattctta 
aagtcaaaat 
catagctttg 
gaaatgtggt 
ctgtattttc 
ttcaaggcca 
agtatataaa 
aatctaagat 
gactcaagtt 
ttttatacca 
ttttaataat 



cgaatatgat 
tatttcaaat 
gaaatacatt 
gaggactcat 
tggagaaagt 
ctcgatcact_ 
ctctgtactc 
ttatacagat 
catgctattt 
aaaaatatac 
aggccttgat 
catccaccat 
cataatgaca 
aacttattat 
aatttatagt 
catagtttta 
ccttctacta 
tctgtctctg 
aacagactac 
gtatttctaa 
tatatagaga 
tgatcttgaa 
ctcatttgca 
aaattcacta 
atctatttct 
aaataagtaa 
gtcttctatg 
accataaaaa 
gttgtcagta 
tctgttgccc 
aggctcaagc 
aatttttgta 
gtagatggca 
catgatataa 
acacttacaa 
tgtgtgtgtg 
gagtatccta 
tttccacccc 
cctgagatgt 
caaaacgaaa 
ctgacaccag 
tatttggtga 
aggcatcatt 
cacacggttt 
ttattattaa 
agacacattt 
cagaataaat 
ttgctttggt 
tggaggtctt 
gggtgggctc 
gtagatcttc 
ctctatcgca 
ggacctaaca 
ctgttctctg 
tggccagaaa 
agttccaata 
attgaggatt 
cattctgtta 
gaattttgaa 
tgatttttat 
gtgccaattc 



gggtgttgct 
attccctgtt 
tgtgactttc 
gtaagacaga 
aatctgccac 
tttatatccc 
ttttgacatc 
gtgggcttgt 
tagttatcag 
aaaaatcaat 
ctcacagcct 
ccaaaggcag 
ttaataactt 
tataaccata 
ttaaccattc 
ggggtccatt 
agtaacctgg 
aggactttta 
tgctactaat 
agagcttttt 
ttcatataca 
atcaacatta 
ggagcacctt 
tacaacaatc 
ctattcaaac 
aaacaaaatt 
ttttgattat 
taactcattc 
atctgatttt 
aggctggagt 
gatcgtccca 
ttttttgtag 
cattagttca 
gcaaggtgct 
gtgaggcaac 
tgtgtgtgtg 
gttgatcctc 
acccttccag 
tgttagaatt 
attagatctt 
atctattaat 
tgacaccacc 
ctcttatttc 
ccatagtttt 
ctcctaggaa 
aatgatgcaa 
aatgggaatt 
tttagctttc 
atttcataat 
aagaatttgc 
tcattaaaaa 
ggattcaaca 
ttactcttgt 
aatataccat 
gcccttccac 
aagtttcttc 
gaggaaatcc 
atgttccacg 
tcacaaagtg 
gtaatacatc 
acaaaagaat 



aaaataaact 
gaaatataaa 
agagctgtca 
aaagaactac 
aggtcaccca 
tttcattcca 
ctaactcctt 
cagaaaaatc 
ctgctgagct 
agaactttca 
gatccttaac 
cgtagacaaa 
ttaccttatg 
aacataaata 
acatttgagt 
tttatttctc 
ccttattcca 
atttcatttc 
actactgctg 
aaaaatatgt 
tattgaactt 
taacaacctg 
atataagatg 
tgattctgag 
tttcatttta 
caccaaatct 
ttcttttaaa 
aaagtcatca 
aacccaacag 
gcaatggcac 
agtagctggg 
atgtggggtt 
cgttatttca 
aagtctgtgc 
aaaggtatat 
tgtgtgtatc 
tgccaccatg 
agggatctgg 
taatgaataa 
gacctacaca 
acctttccta 
aactaggatg 
taataattaa 
tttttcaggc 
taatatctac 
gtatggatac 
ccacatggaa 
tgagtaaaca 
gtatccaggg 
atttttcaca 
taagtgtaac 
gagggatgac 
tttacagcag 
tctctttggt 
ctttacagtt 
taagaagagt 
caaaattcca 
tgcttccaac 
tattcattgt 
atttgtgtct 
ttgacttaac 



tctcaagaga 
cataaaggaa 
ccgaatagat 
gcacagatca 
gacagggagt 
aagccaagca 
ctgctaccct 
acaaaatttt 
gttttctatg 
gtgcacttgc 
tcatctactt 
ggttcaatga 
ttttatgttt 
tttgtatcat 
ctttaatccc 
tccacatgta 
ttatatattt 
attggtcaat 
aattactatt 
cattaagtta 
atgtttttat 
ataaaataga 
taaattaggt 
actccaaaga 
tttgccgcat 
tgtatctttt 
tttttaaaat 
aagtaaaagt 
actatataca 
aatcatggct 
gactacaggt 
ttgccataag 
gttaaggttt 
tgcaaacagc 
aggggttcgg 
aggcagaaaa 
ctctttccat 
atgtggatgc 
ggccaagtat 
aaatgtggct 
tggtcacttt 
aatataaatg 
ataataaatg 
aaattcgcta 
atttaaggaa 
ctcatgcaaa 
gcttatgaga 
agcccctaca 
ctttagccct 
aattccaagg 
cctctgcaat 
aatctggccc 
tagttctgcc 
atctttgcac 
tagaaactgg 
gctgcctatt 
gtacatttgt 
ggctccacga 
ccacctgaca 
ctaagagttt 
acttctgagc 



catcaagttt 
ccattaatta 
attggaaggc 
ccgatgatga 
gggaaagtga 
tgttctataa 
catttaccct 
aatgttagaa 
tttgagttcc 
aggtgggttt 
tccaatactg 
gtaggaactc 
ttggagacaa 
taataacagt 
tgctcaattc 
gtgatttatg 
agtttccata 
gtatctgttc 
attattatta 
ctgttttaag 
tttcttgcat 
gtgaaggttt 
aaacgtttaa 
tcgtttttta 
ctaatatctg 
aaaatgtatc 
aaatttgtgt 
aatcatctct 
tatttaagac 
cactacagcc 
gcatgcttcc 
acgaattgtt 
cctgagcctc 
agtcatccta 
gaaaaactac 
ttatcctcac 
gggtgccctt 
ctatgctgag 
gctaacattt 
agctcctctg 
atgttttgaa 
ttcattttca 
tggagataca 
aaatctcata 
agagtaattt 
caaatgcctt 
aggtctttaa 
cactgagtat 
agacatgctg 
tgatattgat 
taacagaata 
aaccaccttg 
attcgtaagt 
tttgatcttg 
ccacttaccc 
tccaaattta 
ttcataatgc 
agtggtatga 
tcttttttta 
ttcacgtgga 
attattcatt 



265440 
265500 
265560 
265620 
265680 
265740 
265800 
265860 
265920 
265980 
266040 
266100 
266160 
266220 
266280 
266340 
266400 
266460 
266520 
266580 
266640 
266700 
266760 
266820 
266880 
266940 
267000 
267060 
267120 
267180 
267240 
267300 
267360 
267420 
267480 
267540 
267600 
267660 
267720 
267780 
267840 
267900 
267960 
268020 
268080 
268140 
268200 
268260 
268320 
268380 
268440 
268500 
268560 
268620 
268680 
268740 
268800 
268860 
268920 
268980 
269040 
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gatacacagt tgaaatacag atagtttaga 
atttttttta aagctcaaca tttcctagag 
ctctcttatt aaaaaatgtg ttaatttaaa 
tgtgtgtgtt ctgtgtgtgt ttgtgtgtgt 
cctaagttta caagattcta- aaccttttca 
ttttggtaga gaaatcttaa tgaagcttaa 
ggttaatctg cttcaaatga tggtggatgc 
gctgcctata gtctccctta aaggttaagg 
tcaggctact gtacatcaga gcactgacaa 
aggacctatt cagaaatgtt ggtttataga 
acagcatggc ttaagtgagg aatttttact 
aatgcctatc ttaactcttg agatgtggcc 
ggttttctca gacatgtgaa tttggggaga 
tgtggcactt gtcagattaa caaaaggagt 
tgatagagtt tcaagttgag aacagtataa 
gggtaagaat aaaaggtgtg tgagccatag 
agcaatagga gctaaaggca ctctaaaagc 
tcttttgttt tcattttaac tttctgcttt 
atttaatggc aagttgcctt taaaatgtat 
tgattttgca taaataatca cagagaatgg 
taattcccat caagagcgtg aaagctacaa 
tcatccagga agtctctttt gttatctaat 
tttttttttt tttttttttt tttttttttt 
ggagtgcagt ggtgtaatct tggctcactg 
tcttgcctca gcctcttgag tagctggatt 
tttttgtatt tttagtagag acggggtttc 
tgacctcata atccaccacc cccctcagcc 
cgctgtgccc agcccttgtt ttgttaggtg 
agtggtggtg aagttgtatt atgttttgaa 
gattaacatg tttttaatca tttatgcact 
tgtaaagaaa gaatacactg catgtaacac 
gcagaagata tttaaacagc ctggattaat 
gactgaatga gttacaaata aacaagttcc 
tgtctcataa ttgtatatgt attgcattta 
acagttataa cgtgcatgag caaaactata 
ggtgaaacag ccactctagt tatgtaaagt 
taggacgaca ttgacatcag catcagttat 
cttaatcaag agagacatga cacctggatt 
aggtggattt ggtgaagtcc aggtggggaa 
gcaatctgct cacagatcca caatattcgg 
tctgtgatgg aatatcaatg cttccatttt 
taaaattaaa aaaatatata tatagccagt 
taaaaatgca aagcatagtt tctgtctttg 
aaatgtagaa gaagcaatta taagcaggat 
tgttgtacaa atcaattctt gtgcctaggc 
tccacattcc aagtaaattt actaattcaa 
cagaaaataa gagtgaaaaa tttcaagaat 
gaaggcttgg attagcaagc atttgattga 
tggagaggtg gggaggagtg agggttatta 
aacacaggtg aaaaatgagc atgaggggat 
ctggaaatgg aagttttgca tggagagcat 
gaaataacag ggctgggttt gaagctgttg 
gactcagaga aaaggggtgt ttggaaaatg 
gcaagattaa actggtagct atcatttcca 
attaaacaaa accaaatgct aacatcagca 
cctgaaattg caaagatctg gtagaataga 
gttgcttctg ctttccatgg ctctccttat 
ctccccattg tcagcctgga acagaaaggt 
catagatgga caagtgcttt aggaaactac 
tctcatttag aacaaacatg gaaaggaagg 
atcaatatac taataattga gactgagtga 



taggaggaaa agtcagtata aaggaatgaa 269100 
attttacttc tcatatctaa tggtgtaatt 269160 
ctattataat tttatcagat gctgatttcg 269220 
gtgagagaga gagatgatct gatgtgtatt 269280 
cttataaata gatagttcag agggtgacat 2 6934 0 
aatgcaagtt tctgtgtaga ggtaccatct 269400 
accatcatta cccaaatggg tgaaagcctg 269460 
agtgatatat gggtgctaaa gaaaggctta 269520 
aacttcctca ggaaatcacc aaatgaaaca 269580 
tatgccatta caatggaact catctaaatg 269640 
cccaggtcag gaaattctaa ggcaggaact 269700 
agaattcagg ttgtttttct actgtgtgct 2 697 60 
taagaacaga atagcatatt agctgcaaca 269B20 
gggaggcttc atacaagtct ccattttccc 269880 
tgttcatatt ttacatcctg atgggaaaat 269940 
aaacagaatc tgtttctttt cactttttat 270000 
tcacttgaag gccacaaggc cttgcttttg 270060 
tttacaaaaa aatctatatt ttgtcccatt 270120 
ctggcctcat ' cacagaaata ctaaggtatg 270180 
catttcttaa gttaattgtg tctttctcac 270240 
tttaggtgat * ctttcagacg attagtgtag 270300 
ctaagtcact ctagctataa ttcccccttg 270360 
gagatggagt cttactctgt tgcctaggct 270420 
caacctccac ctcccgggtt caaacaattc 2704 80 
acatacacgt gccaccatac tcggctactt 270540 
accatgttgg tcaggctggt ctcaaactcc 270600 
ccccaaagtg ctgcgattac aggcatgagc 270660 
agaaaatgag cctgtttcca catggagggc 270720 
aaatactgaa agcaaacagg gagggaagca 270780 
aatattgtat acaatgtgtg tgtgtgcctg 270840 
agtaaggtat tgtatttata tattgctcag 270900 
tcactcatct tgtttgtact aagaaacaaa 270960 
ccaaattgta ttattgtggt ctcatttgag 271020 
tcacaaaagt tatatttgaa cacaaggtgg 271080 
gtgagccata tctagcgatt tagctttcca 271140 
gataggaaat aaatcaatcc catcttagcc 271200 
tccaataact gcataatcaa tatttatggg 271260 
cgctgcatgt gagaagggag ggcaatgtca 271320 
agcagagtaa atcagcagca aagtcaacaa 271380 
ctgctttata gaacacagct ttattctttg 271440 
gggaaaagtc taacgaattc tttagctttt 271500 
ttgtattttg tattttttat aaaaggcata 271560 
agaagttttg aatcttgttg gtgagtcaag 271620 
ttgaatccta cctgtatcac taatttgttg 271680 
caaaataaag caaaaaaaga ctacactggc 271740 
tggtactata aaatgtcaat gaatatattc 271800 
gaaggaaaag aagagtcaga tgtggtgaaa 2718 60 
tattaaaagc aaggataaga atttgctgaa 271920 
tagaaaaaaa atgaagagaa taggaaaaca 271980 
cacctgggat tgagaagatg acatcaactg 272 040 
caggtcagtt tacagaacgc catggacact 272100 
aaaatttcaa tcaggaatcc tttcctatct 272160 
ttcatctgaa gcaggggggc ggggcgggga 272220 
gatatgaccc agtccgtaga atgtttgatt 272280 
ggtccacaac aaacagaaga tttacaatct 272340 
tttcacatg'g caatagttgg cagaagctaa 272400 
ctccacattg ttgtgactga agctggcctg 272460 
gtt.tgcatca gagatcacta ctgtagggac 272520 
taccgaggtc tcatcctgag atgagaaggc 272580 
attgtatgtg agttttgttt tgaaagcaga 272640 
actagtctag gaggcaaagg agaggaacta 272700 
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gcttacacac gaggaagctt aaaaatattg 
taggcatggt ggctcacacc tgtaatttca 
cttgaggcca ggagtttgag accaacgtgg 
aatttaaaag ttagcccaat gtggtggtac 
taagatggga atcacttttt tcctggagtc 
tgtactccag cctggatgac- -aaagtgaaac 
taatactttt gaaagtcctc agatagaaga 
atatgacaaa ttaaaaatat taaatttaag 
gtatacacag gttgacccag aaaagaaaca 
catgcctaca ttacttttga gtacacatat 
aaaatataat atgttcttaa taacttacat 
atatattgaa tgttgactgg tcaagagttc 
atgttagttt ggccaactga aagtaaatat 
tattaatgaa ataatatgtc tgaaggtctg 
tgaacatgaa gagctactaa atgtaactga 
actaggtact agtttagttg catggctgaa 
atgatatgta gctgactgtc aaatcatatg 
taactcttaa gtgaattgtg agtttctgca 
gtactgctgt gcttaattca aaaccaccta 
tacatgatgg gtagatttaa aaaatagttt 
aatcccgaga taggaaga.tt gcttcaaaaa 
tgcttcaatt gtctattaaa agttttgaag 
gcaaggttca aaatcacgtt tccagtaaac 
aacaagtcca caatagttta aacagacatt 
gatttggttc ctgtttattt tctgagttgt 
ttctctttct catgtatcat gtcttatttc 
agcagaataa tctgtatgca cagtgggaga 
gaaaaagaag tttccagcgg ttgatacatt 
agcatatgga gacatgctta aggatcaata 
tgaaaattta tatcttgact gaggatatgt 
tttttggagt gacacatgag aaatggagga 
cttgccaatg attaagccat ccttttgcct 
atcttttaga gtagttgggn ttctatatat 
ctaaaattag tgtaatgagc ttaagttaga 
atatacttgg tttaacgaat ttcttaagat 
aaatataatg catatcttta tttatttgtt 
caaaaaagac attctgctag ccttatgttt 
ttcaaatatt tacccccaag gcttatttta 
agatacctta taatgaagtt tcctcttctt 
gtttctgaat tatcgagtag acctctgaga 
aaccttccaa gatgacattt tccttagaaa 
aaaatcaaac ttctttatga aaaaatatac 
tattaattaa aatgtaatgg taggaatagg 
gagatgacaa gtgcgcacag ctgaggaacg 
aagttcgata tcaccataac cttttgatgg 
ttttattcat actgtaatat tgcttttaaa 
atcaatagaa tcagttaaaa tatacagaat 
ttagaatatt atgttgattt aagacttaat 
ccacaaaaaa acacttttag aagcaaaagt 
aataatattg gtttagttga caaaataaaa 
tctaaaaacg gaaaataagg ctgggcatgg 
gaggttgagg tgggtggagg atcccttgag 
tggcgaaact ccatctctac taaaaataca 
ctgtagttcc agctgctttg gaggctgagg 
tgttgcagtg agccgagatt gtgccactgc 
acaaatgtaa tcacataatt tttttagttg 
tgatttatat ttaatcataa tttattgatt 
cagagaggga atgaacatat attattttgg 
catttccttg gttttgaact tagccatgct 
gtttcaaaat atttgaaatt atctaatggc 
attatattga aaaaatgtat taatattaca 



tttgtgtata ttaaaaataa aataaatgcc 272760 
gcattgtggg aggccaaggc gggagaatcg 272820 
gtaacatgga gagactccat ctctacaaaa 272880 
ttacctataa ttctagccta ctagagagtc 272940 
taagactcca gtgagctgtg actctgccac 273000 
cctgtcttac tccaaggaag aaaatgaaaa 27 3060 
ctcttacctg cttcatttta ttagtataat 273120 
tgtttatttg ccacttactc aaaattttag 273180 
aatgctaaat ccaaaaatat tgtcgtgctt 273240 
ttcttctgta acccattata aggagttttc 273300 
taaataattc agtggtaaat acaagttata 273360 
ccttttttca taaaacagat ttcaattctc 273420 
gcttggtaga aaacagtcta agtgagacat 2734 80 
gtactaatgc tggatcacga atataatttt 273540 
gtttagactg ggaaactttt ccagtgatcc 273600 
tgacccagct atttgccaga ggagagggaa 273660 
ctatttatta tagttatatt ttgtttgttt 273720 
aatcagaaaa ttgtttatta atgtttgttt 273780 
atgagtgatt tttgaaatca acccaattca 27 3840 
tacattgttt cttatttcgt ttagaactgc 273900 
ctataacaga tttctgaaat tgagtaaatt 273960 
tataataaac aaaaaaatat attaagtaaa 274020 
tttgcatttg cagtacattt atgtagacaa 274080 
cacagcaggt gattccaaga ccatgggaag 274140 
cttggatgct ctccttccag ttatgtcttc 274200 
aactattctg acagcatgtc aagcattagc 274260 
gagaggcaga ccaaatacac attgaataag 274320 
attctcccta gcaaagggct agatttgcta 274380 
gggagaactt tgaaatatat gcccatcctg 2744 40 
tttgatagta taagtaatat tctcataaaa 274500 
aaccataatg taaaaagcac atgttaagtc 274560 
tataaagaca t'tgattttca ' ttctatcctg 274620 
cttgagaatt cctatgagaa taaaacacag 274 680 
ggaaagcaag ataccctgga tcttggataa 274740 
atatccatga ggatatcatt ttccttacca 27 4800 
tttttaacct cagagcttct agggtagact 27 4 860 
tactatgttt ccataaatgc taacatttct 274 920 
gatgtatcag ctaagggaga ggagtcagaa 274 980 
tttcttatat tcgtaacaat ccgataaatg 275040 
actcagcagg tgattccaaa tcaaattcac 275100 
atcccagtat ctaaccttac caactggatt 275160 
atcattaaat tgtgaaatca aatatagctc 275220 
aaaaagtgtt agtgtgcgtt gtaatagaaa 275280 
ttccttatat ttccaaggtc tctgtgtcag 275340 
gtcttagtta ataaatagtt cgttattata 275400 
ttcgtcctct tcattcatgg ctgtctttaa 2754 60 
ataaagtaga aaatacagcg tattttctgg 275520 
tgcattctgt gaaatttagg tttaatcatt 275580 
gatgattaaa gtattcaaca aacataattt 275640 
tgttgttgtg aaagcctttc ttacccttgc 275700 
tggctcactc ctgtaatccc agaactttgg 27 57 60 
gtcaggagtt tgagaccagc gtggccaata 275820 
aaaaaattag ccaggtgtag tggtgtgtgc 275880 
catgagaatc acttgaaccc aggaggcaga 275940 
actccagcct gggcaacagg gaaaataaga 276000 
ccatagtggg tttattacac aagcatgtgg 276060 
atattaaaaa atagagacac aatggtctca 276120 
ctaactatat tttatgagta actgcattta 276180 
atgaacaact gaataatctt tattgttata 27 6240 
tcttagatta tacaataaaa attataaaac 276300 
aaaccttaca ataaaacata ataatacagt 276360 
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gttgttttgg catgtgtctc ttttatataa 
tctttttttg agattttact aaattacaat 
ccaacaagtg gccgggcgct ctgtctcacg 
tgcgtggatc acgaggtaaa gaagtgaaga 
tctactaaaa atacaaaaaa ttagccgggc 
tcgggaggct gaggcaggag, aatcacttga 
gatcgcacca ctcactgcac tccagcctgg 
aaaaaaaaaa aaattaccaa caagttgata 
tttacccctt tggtattaca gctgtttatc 
gggttaaagt taattcgtat ctataattta 
aaaacttttt ' ataatacttt aatgaaaggc 
agtatctcat caataatttg gcagatatct 
tataaattat ataaaagtaa cagtattcat 
tttacatcta gcttactgtt ttagtaataa 
acatccttaa aaaatctgag atttcttgat 
tattaaactt aagtaaccag gatgtaaact 
aatgacatct ctctgagttt ttatgtccaa 
attactgcag ggtagatggt aatcctagac 
tctgaccatt ctctgtaaca cagtttcagg 
atgaacagtt agacataatg tgatctaaag 
aaaattattt gctgaaaact ttaaagtcat 
tatatttatt atttagtcat ctgcctttct 
tcttttcctt catttgggga ccatgagaat 
ttttgttttg ttttgttttt ttttttttgn 
ttaatctaag aaattccagg agcgttactc 
actgcagatg atctggcttt ggaacattac 
agtcagaatg tcagtgctgt tgctaaaatt 
tttttttccc cagctgaaaa tacaaggcaa 
tgcttttatt ctcatattag ggttcaatat 
cactcaaaat aatcaatata ctgagagctg 
tttatgtaag ccttgttgcc tatttggaat 
gtgatattcc ttccagttgt aatcatccac 
tttaccccca acttttacta cgcatcaact 
ttaacaatga cattaaaatt tcaaaatacg 
aatcaagcaa gttcccaaag gtcacagagc 
gagtcagatc agtagccaag atttgatttg 
aaactgatgg ctacttaact taattttgtg 
ctcatacaca cacacacgca cacgcacaca 
agatactctt tcaggtgaca attggctagc 
tctttttaag cagagtcttt ctctgaaacc 
tcactgcaac ctccaccccc ggggttcaag 
tgggaataca ggcatgtgcc accatacgtg 
ccatcttggc caggctggtt tcaaactcct 
ccgaagtact gggattaaag cctt^gagcca 
agaacagtta agttttaaaa gcaaatgaga 
gatgaaattg tgaaattata gctaactacc 
tggctttgca tttgttatat ttaatacttg 
ttattacaga atctagaagc cctaagcaaa 
atccccattt gccctatctg tattttcaaa 
aatatttgta ttttaatatt tacctagtca 
ttaatgtcaa tgtcattcaa acacaaagta 
acagagagtc atggaaaata taactcataa 
tatttttcaa aagggaaata tatatacatg 
aggtactaaa ataaatataa tgtaacttac 
aaaattctat atttttgcta tactaattca 
cttatttctt tatggagctg tcagagctag 
agcctttata tgatattata cttaaatcat 
aatccaaayg tattctttga aacatttgta 
aaaggtaggc acctggcaca atccatctct 
tctatccaag tttcattgac ctttagtcta 
ctacaactaa aatttaaccc catactgata 



aaatcagaag gagaaaatcc agttatttgt 276420 
gccctcaata atatgaagac atacaattta 276480 
cctgtaatcc cagaactttg ggagccgagg 276540 
ccatcctggc caacatggtg aaaccttgtc 276600 
gtggtggcgg gcgcctgtag tcccagctac 276660 
acctgggagg cggaagttgc agtgagccga 276720 
gagacagagc gagactccgt ttcaaaaaaa 276780 
atttcctaca gaattaacgt caccctgaag 276840 
atgtacataa ctcccttttt attttgagag 276900 
tagagtaaaa ttatatttgt taacaaaata 276960 
ttattcacca aagaaaataa taaaaattac 277020 
caactcttat tttttctcgt attttaagac 277080 
tgctaaacag taaaacaatt tttttctgtg 277140 
aaatgttctc aattttcatg cattgcagtc 277200 
tttctgtaaa tttggcattt tagtctgaat 277260 
tttatcgctt ttttttcatt taaaattaag 277320 
aattctggat acttctaaaa ggatgtgagc 277380 
tttggaatac aaattaaatg agctgttcta 277 44 0 
gacttgaacg acctgatttg aaaaacaagt 277500 
gtagagtatt tatgtataat cttaagttac 277560 
atcgtatctt aatatatata- accttattta 277620 
ttccaaaaat. gctgaaaaat aagtcagtta 27768 0 
tttctctttg agaaatatgt gttttttttt 27774 0 
ggcagctctg tgagctcaaa tttcttaggt 277800 
tcaaccatga aagctttttg ataggagtga 277860 
cagtaacttt caaagaattt tcttcatgaa 277920 
tctggtttaa acatgtttct ttttcttttt 277980 
acactgccgt tcaaaaacat taaaaatcct 278040 
cagtggatgg aagtaaaata gtgtacacat 278100 
gtagccttac tttgagttct cacttctgct 278160 
gtttttctac ctgatacagg aaatgagaag 27 8220 
gtttctatgg tgtaaagtca ctgtattcct 27 8280 
gcatgccagt tactggctga gttcttcaca 27 8340 
catttgagat agggataaaa atggcagaga 27 8400 
ccaataagaa acacaaacag aataacatag 27 84 60 
tgtgatcacc ataatataga tcttaaacca 278520 
aaaagaatga acacagagag aaagatgggc 27 8580 
cacagcatat ttaactgcat tctactaaac 278640 
ctttttaaaa ttacttaatt aattattatt 278700 
caggctggag tgcggtgg.cg ggatccggac 27 8760 
cgattctcat acctcagcct cccaggtaac 27 8820 
gctaattttt tcagtaaaga agaggtttca 27 8880 
ggcctcaagt aatccgcttg ccttggcctc 27 8940 
ccactccagg actggctggc ctttcttgat 27 9000 
aatactctct ttttaaaact ttccctttgg 27 90 60 
attgactaga caaaactcca actgtgtttt 27 9120 
actcaagaaa accttgtaaa gtgtcttttg 27 9180 
tatatcactc tttttctata tattaggaga 279240 
ctttggggct tcaaatgttt gaagtaaatt 279300 
attttctaag aaattcaact tactaaatgc 27 9360 
tatatactac acttaattaa ggattcattt 279420 
tttataattt acaagtctca acattttgca 27 94 80 
tatacataca tacacacaca tttagatatg 279540 
taataaaatc aatgtattgc atgaattgtc 27 9600 
actcttttgt gccacattat gttagagtca 279660 
tttactcagt aaaatactct gtatgtttac 279720 
atgcaatata attcatatta tgtgcaatgt 279780 
atgtgtattc agtaaaaata tactgaaata 2798 40 
tagcccttct cacgttgtat tgaaatgatt 27 9900 
agctcctatg gggaatttca taccatgctt 27 9960 
aataattggt ggtcaataaa tatttgtttc 280020 
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agctggctga agaagtaagc ctttccaatt 
tccaaataat tagaatcaga ggaagagcga 
atatcaacct tgtggataaa tagagggtta 
tgcttagctc tgcgaatctg ctaattctca 
gtgtgtgtca tgtgaaacac ttcgtgcaag 
caataggtat cagatatcac tgcacagtta 
acttatactt tgaaatacgt" "cactatatga 
accgtatgac ggttgtaaaa atgaagttta 
tatatatatt tgttttctga tatttattat 
ttatataaca aagtacacca ttgaacttca 
gacttttagt aatataacag cattttagga 
taacaaattc tgtcctgttt taattattta 
tctgaaaaaa aaaaaagaaa aagaaaaaga 
gatggttcat gcctgcaatt ctaataccct 
ctgaagacca ggagttggac ctgggcgact 
aataaatttt aatttgaagc aatggtctga 
gataatttta tgcctgattt tgatttttct 
aaatttcaga agtggcttca cacttaattt 
aatgatctac atggtaatta ttttatgtgg 
ttttactccc caaataaaat atggatatat 
gaatcatttg atatgtttat gctgaaatag 
atccaatttt attctgcatt acgaagtgtt 
tctagcctct aacatcctgc aaactggcat 
agtcacaaaa atattcatta aaattaaaaa 
taaactgatc attgtactct actttccttt 
caagttatat tttataagtt cataaatgag 
taattgttaa gttagtattt atcacagggg 
gttgcaagtc tcaaatccta cttagatata 
cttccttcac tacagagtat atatgctgat 
attcctgcca tatgattgca ttttacagta 
tgaatctatt cacagttttt atatatactc 
atgtgatatt ctcaaattgt c.attgattat 
tcttagtaac agtttctaca tcctggaatt 
gtgatggtta atagtgtcaa cttgattgaa 
gtgagggtgt tgccaaagga gattaacatt 
cccttaatct gggtgggcac catctaatca 
tcgaaaaaaa tgtgaaaagg ctagactggc 
gctggatgct tcctgcactc gaacatcgga 
tggcttcctt gctcctcagc ttgcagacaa 
taatattctc taataaactc tcctattatc 
tctatctata ttctattact tctgtccctc 
ccacagtatt atttacccac atatagtaac 
tatgatgcag ttgttatttt ccctcatata 
gccatttgct tgagagtttt tagctaataa 
taaaatgtta tttttgttaa tattgaatct 
agactgggat atatgtgaaa tgatttaccc 
attacgttct tctcaggttg ttacttcata 
tgacaagaat atttgggctt tggctgaagt 
attttgatag taaatattgt gtccaagggt 
ttacccctct gtgattatag aagacttata 
atgattttcc atataatgac cattagcttc 
aacacacatt tttattgact acaaaccata 
atagaggcat atttataaat atcatttgca 
tttattttaa tttttttatt tcaatagctt 
atgaattgta tagtagtgaa gtctgagatt 
ttgtatccaa tatgtcattt tttatccctc 
tccattatat tactctacat tttgcatacc 
agtatttaat tttccattcc tgagttacct 
aagttgctgc aaaagacatt atgtttatgg 
acattttctt ttcttttctt tttttttttt 
aggctggagt gcactggcac aatctcagct 



aataataaaa attaaaattt tatgggattt 280080 
gagagagaga gagagactga gatctcaggc 280140 
tagaaatgaa ttgacaaacg tgctgtaaat 280200 
ggagagtaga aaagacttga catttatcaa 280260 
gtaattttca ctctttattc cttttagtac 280320 
gctgcaccta attatccagc taatttttct 280380 
caatggtaaa ctgtagcatt ttcaaatttt 280440 
ctacttttaa acattccatt ttaatttata 280500 
cgcatatgaa tgattttata tgtgattaat 280560 
tgaatatgaa atcataatct ggtatacata 280620 
ggaattatat tatgagctga gtggcaagga 280680 
agtcacagtt ttactagtag tagaaacaat 280740 
aaaagaaaaa caaaagatct ggtcagatgt 280800 
gggaagctaa ggttggaggt tgtagaattg 2808 60 
tagtgagacc ccgtctctac aaaaaataaa 280920 
gaaagtgaga aattgtcgac ttgtaccctg 280980 
aacatttctt actttaaaaa ttatccttga 281040 
tcatgatcat tttcaatgtt tcttccaaaa 281100 
cataattctc tagttgagga gacacgatgc 281160 
agctctttgt catatgagca catgttgcaa 281220 
aaccttaatg actcatttat att'tccttac 281280 
taaataatat ggtataatat attgtttgag 281340 
cagaataatt ' tacttttcct aagatttaat 281400 
cattacatct ggaaatacaa ctttgctgtt 2814 60 
tttgtaatcc gaatggtaag gtataattta 281520 
gagtaaattt tgttttataa aaacaaacaa 281580 
gaaaggcctt gattcaacaa tccctgccaa 28164 0 
tattacttgt ttgcgtactt ttaaatgtgc 281700 
ggctttattt tttagcactt acattcatag 2817 60 
taatacgaac atgttcataa agatttcatt 281820 
catgaaaatg ttatactttt aatatcaatt 281880 
tctggtccat tgttgagaga tgttcctcac 28194 0. 
cataataata acaagagaca tttagatagt 282000 
gtatgcaaag tattgttcct gggtgtgtct 2820 60 
tgcatcagtg ggctgggaaa ggcagaccca 282120 
gctgccagca cagccaggac attaagcagg 282180 
ctagcctccc agcttacatc tttctcccgg 282240 
ctccaagttc ttcagctttg ggacttggac 282300 
tctaaagcgg gaccctgtga tcatgtgagt 282360 
tatctatcta tctatctatc tatctatcta 282420 
tagagaaccc tgactaaaac acatagtaac 282480 
cattagctta tttaagccat accacaaacc 28254 0 
aacgagaaat ctgaggatga cagaagttaa 282600 
gtgaccagga cagaaattaa cttatctcta 282660 
cagctatagt aaacaggata ttttatgtac 282720 
ttatttatat tccaggaaag aagaaaaatg 282780 
aacggtataa ctttttattt tgggatatta 282840 
ggaaggactt gggactacat gtcaaatttt 282900 
gcccatcctt taaagtatgt aaattcaggg 282960 
tcctgtactt gataatggga aattaactaa 283020 
tcaaaggaac taaaaaagtg ttaatgagaa 283080 
atttttgatg gtctcataat caatttaaat 283140 
tttgttgttg gatttttatt tgatatatta 283200 
ttgggataca ggtgtttgtt ggttacatgc 283260 
ttaacacacc agtcactcaa atactgtaca 283320 
accaaacctc cctcttctga gtctccaaag 283380 
cagagttccc acttacaagt gacaacatac 283440 
cacttagaat aatggcctcc agctccgtcc 283500 
ctgattagta ttccatggtg tatctatacc 283560 
tttttgagac agagtctcac tctgtcaccc 283620 
cactgcaacc tctgcccccc gaggtcaaga 283680 
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gattctcctg tctcagcctc ccaagtagct 
ctaatttttg tatttttagt agagacaggg 
ctcctaacct cgtgatccac ctccctccct 
taagccaccg cgcccagcct ataccacatt 
ttagaacggt tccatatttg caattgtgaa 
gtctttttaa tataatgact Jtctttttatt 
attgaatagt agatcccctt ttatttcttg 
ttgtactaat gtacattccc accagcaatg 
gaaacggcta ttgtttgttg actttttaat 
ctcattgttg tcttaatttg cattttcctg 
ttcttttttt gagaaatgtc tgttcatgtc 
tgctgatttg tttgagttat ttgtagattc 
ttgcaaatat tttctcttat tctggagatc 
tgtgcagcag ttttttagtt taatgtgggc 
tgcttttgag gtcttattca ttaattcttt 
taggttaact tctagaaatt tatagtttca 
taatggatgt atgtataagt tgagagatcc 
ttttcccagc accatttatt aaacagtggg 
ttgtcaaaat taagtcagtt gtaagtattt 
ctttggtgta tgtgtctact gtatactggt 
agtatagttt caagtccagt aacgtgatgg 
ctttttttcg ggctcttatt ttggttccat 
aaaatgatat tggtattttg atagcaattg 
tggtcatttt catgatattc attcttccaa 
ttgtttcatc tacaattttg ttcagcaata 
cctccttggt taactatatt accaggcatt 
gttgtgaaag gggttgagtt cttaatttga 
aatgctaatg atttgtgtac aatgattttg 
aaatctagga gtctttcttt atgatgtaaa 
tctaagccta cccttacatc ccagttacgt 
cttggaaatt ccctactctg acccagatag 
ttttattttc cagatggaat aaatgaagac 
ccccaaaccc ttctgcagta gtcttttcac 
•aacaaacaca ctctttattt gttcacctaa 
caatatttta ttttaaagca tactggaaag 
aagggactcc ccccacctcc accaacacac 
aggacgcttg gtcagtaaag gaatttctca 
gatttgcaca cgtgaagcca ggcttcacat 
actgtcaaaa tcaatgaaaa cttcacaata 
tcagttgatt gatataatat tagataagtg 
caaatggctc agtgaattgt ctaacattcc 
ctcagctatc tacaatttgg ttggcagggg 
tatgttctaa tgctcagcaa actgtgaaat 
taaaaaagca aaactttttg gtatgttaat 
attaacaata aactaaaaca attctacctg 
ggatgatggg tagatcacaa aatcaaagaa 
tgtcaaacag catccctgta aggaacttag 
aataaataag ctgcacgtgt gttttgcttc 
tgggaaagag actcagggca aaaaaagagt 
cacaatagaa aagacatttc ttttcaaaga 
gatactggtc atcaaagcaa gaccttatac 
ctccaaaaag ccattt.tctc attcccctaa 
aacaagcatg ggccatgtat ttaagaaaac 
ccttaatttc agggaacata aattttgcca 
gattctctta ggatcctgat gccttgtgtc 
aaaaagtata ttatatttta ttttaagttc 
ttacataggt aagcatttgc. catggtggtt 
ttaagcccag catgcattag ctatttatcc 
agaccccagt gtgttgtttc cctccctgtg 
tataagtgac aacatgtagt atttggtttt 
ggcttccagt tccatccatg tccctgcaaa 



gggactacag gcacctgcca cctcacctgg 283740 

tttcaccgtc ttgcccaggc tggtcttgaa 283800 

cggcctccca aagtgttgag attacaggca 283860 

ttctttatct actcactggt ggatggggac 283920 

ttgtgctgca ataaacatac acctgtagat 283980 

tgggtagaca cccagtagtg ggattgatgg 28404 0 

actaaatctc tacactgttt cccatcgaga 284100 

tttaagcatt gccttttcac catattcaca 284160 

gattgcattt ctgcaggagt gagatggtat 284220 

ataactcata tgtttgttag ccatttgtat 284280 

atttgcacac ttttgatggt ttttttttct 284340 

tagatattaa tattttgtct gatgcacagt 284400 

gtctatttgt tctgataact atttctttgc 284460 

caattaattt attttgttta cgtttgcatt 284520 

gcctaggcct atgtccagaa gagttttttc 284580 

ggtcttagat ttcagtcttt gatccatctt 284 640 

agtttctttc ttctacatgt ggctagccag 284700 

tactttcccc aatttatgat tttgtacgtt 284760 

ggatttattc ctgggttctc tatactgttc 284 820 

accacgttgt tttggaaact atagccttgt 284880 

ctccagattt gttcttattg catagaattg 284940 

atgaatttta cgtttttaaa aagtattgta 285000 

cattcaattt gtagattgtt ttggtcagta 285060 

cccatgagca tgggatgtga ttccatttgt 285120 

ttttgtagtt ctccttgtat agatctttca 285180 

ttattttttt atttttgtat gtttgcagct 285240 

ttctcagctt ggttgttgtt ggtgcatagc 285300 

taaattgaga attcactaaa tttggttatc 285360 

agagaagcaa aggccaatgg agtgtcaccc 285420 

tgctaactca gaaaaatcag ctgaagggca 285480 

tcttttgatt aaaatcatat ggttattagt 28554 0 

caaaagaatg tttatgtgga aactgtgttt 285600 

tgctgaaatc gttgatatac taccattata 285660 

cctgcatgcc tttattcctt agaagatttt 28 5720 

aacagaagca ttaaagaaaa aaaataaata 285780 

acaggtatcc tctttctcaa ttcatcagtg 285840 

tgatccctcc tcctttctct acagggcata 285900 

gactgtattc cataatagca caccaaaaac 285960 

taaagaatat agctagaatc atggtaatga 286020 

caggatggta acttttataa actaactttt 286080 

atggcaaatt tgtatgaaga ctacctggaa 28 614 0 

aactaggttt gtgagtttgt ttcccccctc 286200 

tttccatgaa atataatagt tacgtaaata 286260 

tattgcacca acttgtgttg tttgactact 286320 

cttgattgaa gcaaaaatgt caatgtgtaa 286380 

acagctacat agtctgactc agacaggaca 28 644 0 

tctacagact cacataggcc ctgcctcagg 286500 

cttgtattct ttgatgtcag ggtcagatac 286560 

atattgattg acagcttcct caagacagcc 286620 

aactgagtat aattgtcaaa aaagaaaata 286680 

tttgtcaaag tttactttaa atgtcacctc 286740 

atcattctca gtatatctca gctttcatta 286800 

agaatttacg ggaggacaag acagaccaga 286860 

aagatgatat tgccatacta ct at ttctgt 28 6920 

ttagttatgt tttttcttaa tcagaagttt 286980 

tgtgatacat gtgcaggaca tgaaggtttg 287040 

tgccacactt atcaatccat cacctaggta 2871O0 

tgatgctctc cctccccaca, accccgaaat 287160 

tccatatgtt tttattgttc tgctcccact 287220 

ctgtttttgt gttagtttgc tgaggataat 287280 

ggacatgata tcattccttt ttatgggtgc 287340 
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atagtattcc atgatgtata tgtaccacat 
tctgagttga ttccatgtat ttgctatggt 
tatcttaaac aaatttgtaa gaaaaaaaca 
atgaacagat acttctcaga agaagacatt 
aatatcactg gttattagag aaaagcaaat 
cagaatggtg attattaaaa agtcaagaaa 
aggaacactt ttacactgtt ggtggcaata 
ggggcaattc ctcaaagaca tagaaccaga 
gggtatatgt ccaaaggaat ataaatcatt 
tgatgtgttt tattagttaa aacaataatt 
acttaccaca tggtgtatca tgcaatgcac 
tgcacagtct cctcacttgc actctttgct 
acaggaaaat ctgaattctc acaattattt 
agcccagttt tattacctgt atgacactta 
tggatttaaa ctcattttcc actttggtat 
ttattgcaga cccactccac cttgtgttgc 
gtgagttcct agagtcagag tatccaggga 
atgaacataa tgaaataaca tttatgttag 
cccctttatt aaagttgttg tttcatatta 
acagagagtt atatgtggta tttgtattac 
gaataaactg ttaggacaaa ataaagaggt 
attctactta gctggacaga aaagctttct 
ggcataatcc ttgcagacaa gaagaaaagt 
cgaacaaata tacaaaggat gaggaaggtg 
ataattaaat taggttattt agaaggtaag 
taatggaatt taaaatcatt ttttaggtaa 
gaattaagta attctataca cattttagaa 
tcaagtgtga aaattagtaa gatgaagagt 
gcacaagaga ttattacata tatatgtgtg 
tatttttaat ccaaagcata tggtaactga 
caggttgaat ccaggatttc atgggagata 
gattaagaat gtgagaaaag aagcagttgt 
ggagtctact ggttactaga aggtttacaa 
tgacattata gtgtactggt ataactcttg 
aggaagggaa ggggacagaa gataaaaatg 
ggcaagttaa aagtagggca ctagagcaga 
agggaaaggt aagaaggagg cgagaagaga 
acaggagaga ttgtctgtaa aagggagaag 
aattttcatc atggaagcct aaggtaaaat 
atctaaaatg gccaagatca atcatgatga 
agataaaact ctgtaatggt gacagcagcc 
agggatgtgg ggaatcctgg ggtgttgaac 
acattaccgc acttcatgtg ctttgctata 
aaaatgtaat acgaaattaa tataacacct 
ggattacctg ggtaccatca gctgaaggat 
ttagctgctg accgctgtac tactctgctg 
tggcatgtta aatggactta gtctctgttg 
cctattctac caactgataa attgaattgt 
aagctttggt tagaccataa ttttttgtca 
actgtttaga tattacatat acttttgctg 
ttttaaaaaa atgtactatc agcagtggta 
aacagtatct tgagcttaac aaagcttcag 
aataaaagga aatacttaaa gcaaaggtat 
ggatggaaaa gtctattttc ataaaatcta 
cttcttttcc tgtttaggtt aagatctttt 
atgattaatc ttattttacc tttacagaag 
catctcctta acatatgaat taattgtggg 
acgtagatcc aggtggctca aatagatatt 
atgtaagcag ttcatcttcc gagcagattt 
gctcctggaa aaagtgttct cccgtgagct 
atctgggttg cccacactgt gtagagtttt 



tttctttatc cagtctatca ttgatgggta 287400 
gaatagtgct gcaatgaaca tatgtgtgca 287460 
aacaacccca tcaaaaagta ggcaaaggac 287520 
catgcagcca acaaacaaga aaaaaatctc 287580 
ctaaaccaca agataccact tcacaccagt 287 64 0 
caacagatgc tggcgaggct gtggagaaat 287700 
tcaattagtt caaccattgt ggaaggtagt 287760 
aataccattt tactcagcaa tcacattact 287820 
ctgttatcag gaaattttaa agcttctttg 287880 
tattagttaa aatgttgtgt ctttcctaac 287 94 0 
actaatgaac gcttcaaaat tagccttaaa 288000 
tatcttgtgt ctgatggctc ttctgcatga 288060 
ctaaagcttt tattaccttc ccttaaccca 288120 
gacagtttac attgtaatcc caccctctaa 288180 
ccccatattt agtcattatt tgtacttctg 2 8824 0 
agtgtgtgta tctgtgatcc cctgtagagt 2 88 300 
tagctctctt taaacatcta atgggattct 288360 
ctaaatgtat aaagagaaaa tatatacttg 288420 
ttatatgcca acaccatact agtcctattg 288480 
ctttaaagat attagagcat aaaataattg 288540 
acaatacaga cactgaaatg ggattggcca 288 600 
agaatagatg aatctcaaag gataaacgga 288660 
gctttccagg tgaaagaagc agtggtacaa 288720 
gagcatccaa tttaaggatc aaaataacaa 288780 
atcatagata aacacacaca tgcatgtgtg 288840 
gaaagattcg ttggaatgtt ttagagggaa 288 900 
agctcattct gatatcctat aaagaatgtg 288960 
cagtaatagt gaatctaaaa atgtgaatga 289020 
tgtgtgtgtg tgtgtgtata catatatata 289080 
taacaggcgg gagtattaga aagggtcatc 289140 
gaaggaatga acagtgtttc cagacactga 289200 
aagtgtgttg aacttttgac gtctatacaa 289260 
agagacctag gtggatatta ctagtctttg 289320 
gtgctatttg gggttgctaa aaatgaaata 289380 
gagggaggat aaaggggaga acatgagagt 289440 
agacagaagg aaagcgaggc atacaaggta 289500 
acctgtgtga tttccaacat ctaagtgtgg 289560 
acatggtcac gaggaaaggg ggtgctagca 289620 
tttcagagaa ggatgtttaa tgtaagtgat 289680 
taagtgaaca ttcagataaa gtttcagaac 289740 
atcggagtgg ttacatggta tggcctgggg 289800 
atggcctatg tctcaaagtg gtgaccaatc 289860 
agcaggtcat ccctcgattc aaaatgtgtt 28992Q 
aagagggtaa aaaatattag tggctttctc 289980 
tcaaaccaaa ctctgttaga ctcagatgcc 2 90040 
tcaaatccaa tccttgccaa gcctctccat 2 90100 
tcccttttag tcttattttg aaggcaccta 290160 
tattactgtt ttacagtaag atttggtcag 290220 
gatctgttgt tctgaactct ccattaagga 290280 
aaattcagag gtactcataa atttgacgac 2 90340 
aatgataact aaaatggtat aatttatctt 2 90400 
ataaaaacaa gattgcattg aatatcatgc 2 90460 
ttatatgttc ttatgtgtat gttcttacag 2 90520 
attttttaat ttacaatata gagccctttg 2 90580 
cttctcttga aagattatgt atattttaat 290640 
tactgtgaat tagcagggca gatattatga 2 90700 
ctgccaaata gtacatgatc ttttactgga 2 90760 
taagggcaga gcacctaaac tgaccagcca 2 90820 
atttttcact gtaaccaaat gtatttacat 290880 
ttgttagtgt gttttcacca aggaactgct 2 9094Q 
tgtcctgaaa agctaaagtc catatattga 291000 
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agaaaaccta ccacataaag atgtttccta 
tgtagtggaa ctaactaatg gatgcttcat 
catagttgaa tattgtggtg tcagtttaag 
cccaaatttc agtggtttga taaaaattat 
cttgtcaaga tttgtgtaat atcatagaat 
ttgactctgt gacataattt tagcagatac 
taatcttgac ttacgagttt "agaatagtgt 
aatataaaca ctctatgtac tattttagta 
gaaacgtgat ctaatatcac tttcaattct 
taaactattt ttaaaagttt atcaactgga 
atggctttaa aaggatggct attttattat 
tttttgtact aaccataggc caattaatga 
aatcttatat tcatggcaat atactacccc 
gagtttctgt tttatctctg caaaatataa 
caccccataa atattaggtg gataaataaa 
aagttttaaa tatagagatt tcattccatt 
caaaaatcat gtgcacaatt cagactcata 
aaaagaaatc tgcaatcttg caaaaatgag 
ttgtagctgt tattcttgcc tttcttctta 
attccatacg taacaagaaa aagagcagaa 
gatattgtgg tattgtggtt gaaaacatta 
tctagctctc catctaactt ccatctctag 
tctggacttt aatttcttta tctggaccag 
atttttccag attaaaatct aagatgtggt 
ttgtcagaca aacaaggtga aaatgtgttt 
acagtgttcc taattcaaca ctacattttt 
agttacttca gtaggatata tctgttgttt 
ttatttcata acataaataa tgttaagaga 
aagtttttac aaattctatg aattccctac 
tgtttataga agccgagaag cagaacccct 
ttcttctaat cttctcaaac caggaaggtg 
agatactatg taaagttaca. tttttccttt 
ttattttaaa taacaagata tgtagctttg 
gccaacagga cccaaccaga gcaagccaaa 
agaacagaat ctgttagaga aaataagaga 
aataaaataa aatagaataa aaggagcttc 
taccaaggga acactaccaa gtaaatatta 
atccacctgt aaaataatga atcagaacac 
aacccaaaat agatcaaaga tctaaattta 
aacataggtg aaaatatgca aaagttttta 
aaagcacaag caaccaaagg aaaatcgata 
atgcattaaa ggatactatc aagaaaatga 
tgcaaacata tgtctgacaa gtattaagta 
agcagtaaaa acacaattca attaaaagat 
gagaagatac acaaatggcc aataagcagt 
tgcaaatcaa acctacaatg aaatagcact 
ataagcaaac agaaacaaaa caactattgg 
cattgctagt aggaatgaaa catggtacag 
ctcaacaagt taaacataga attaccatat 
ctgagagaag tgtgttagtc cattttcaca 
aatttataac ataaaagagg tttaatggac 
actcacaatc acagcagaag gtgaaagaca 
gagagccaag tgaaaggggc tttgccctta 
cactaccatg agaacagtat ggagaaaacc 
tcctttccac atgtgggaag tatggtaacc 
caacaaagcc atatctgaag taagacaaat 
agcagcatat tcattatagg taaaagaaga 
atagaatatt attccatcat aaaaaggaat 
ttgaaaatat tatgctaagt gaaacaagtc 
catacagaag aaatgttcag aacaagtaaa 
gccagtggct ggcgggagat ttccattaac 



gaatacccgt catgatcaaa ggtttgcagc 291060 
tttagtaagg gcctcttttc ttccattgat 291120 
atgagtccat gtgtgacatg gatttaaata 291180 
aagaaatact gcaattatct caaactaaaa 29124 0 
gtggacaaag tgattaaaaa atatattcta 291300 
atctttatat tggccaacct ttctttgaat 291360 
catgtctgga gagaaaaaga cttcatgtga 291420 
ttaattttag aaagatattt ggcacacctt 291480 
aagtctattg catcatcaag cagaattata 29154 0 
ttttatgtta ttagcaatac atgagatgtc 291600 
gtagtatcct tcaaatatta ttttagtagt 291660 
ttcagaaata tatttaattt ctattttata 291720 
tcactgattt ttgcttggct ttattaaact 291780 
taaaaaataa aaatatgctc aaatattaag 291840 
tataaccaaa ctgctcttgc aaaagtgaaa 291900 
ccctttaatt tttcaaaata ttagatgata 291960 
gctattttta gaaactataa tttttgattc 292020 
actcagttta acagtgtatt tccaaagaaa 292080 
gacaagagac ttgtgatggt atctcttaat 292140 
aaagccatgg gaactataaa tataggcagt 292200 
tctggaagtg agaccacaac aggttctcgt 292260 
agatgtgccc ttgacaaaat gattttttcc 292320 
gtaagctcaa agatctcagt ttagtatttg 292380 
atcagataca cttttctatt tttcttttat 292440 
aaaaactagg tttatccatt tttttcctgt 292500 
ctatatactt ttagtagaaa gtgaatcata 292560 
tgctcttgta ctcttccttt actattacta 292620 
tctgggtcta aatgtaggat ttaatttttc 292680 
agtttctttg gccgtcacta tttgtctggt 29274 0, 
atggctgaag actggtcctg atagagattc 292800 
ttcaggtcac tctgtaaatg attcaaagac 292860 
aaaatccagt ctgtaagcaa attttatgcc 292920 
gcaacatgtc atcttatttg gataggctat 292980 
aaagatcttt gaatgacaga cagcttaaga 293040 
aaaggagaaa tgtactgagc agaccaaaaa 293100 
cacacaaggc tgttgggttc attcaccaca 293160 
taaataaaat acttgctgga gaaactgaat 293220 
ctacttcact tcatgccata tacaaaaact 293280 
ataagtataa atataaaact cctagaaaga 293340 
ttagggaatg agttcttata tatgatgcca. 293400 
aattagattt tatcaaaatt aaaaactttt 293460 
aaagcaagta acagaatagg agaaaatatt 293520 
tctagaatgc ttcaagaact cttacaaatc 2 93580 
gagcaaagtt caatattaag ctattcccca 293640 
gaaaagatac tagcattagt cattagagaa 2 93700 
ttgcacctac taggtggcta taattaaaaa 293760 
tgtggatgta gagtaatcgg aatcctcata 293820 
ccactatgaa aaaagaaatg ttttcagttc 293880 
gacccagcaa ttccgcagct aggtaaatgc 293940 
ctgctgataa acacatcccc ggggctgggt 294000 
tcccagttcc acgtggctga ggagggcctc 294 060 
tgtcttacat ggcagccagc aagagaggat 294120 
taaaaccatt gagatctcat cagacttatt 294180 
acccccatga ttcaattatc tcccaccatg 294240 
acaattcaag atgaaatttg ggtggggaca 294300 
tatctcacaa aactatacac caatattcct 294360 
aaaaaggcac atgtccatta tattcataca 294420 
gcggtagtgt tacatactac atgatgaacc 2944 80 
tgacacgaaa gaccacataa tgtatgattc 294540 
ggcacagagg cataaaatag attagtggtt 294 600 
agctaacatg tacagggttt ctttttgggt 294 660 
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tatcagaaaa ttctggattt catatttatt gttgcagagc atagtgaata tactaaaaca 294720 
aacaaacaaa acactaacat atactcttgt aactggtgaa ttttgcttat gtgaaaaata 294780 
tgtcagtaaa aatgctaaag taaaacaacc tgaagagttg actatatata tttgtagata 294840 
tacacaagta tgcatttagc ttttataaac aatgtcttgt taaattatta ctagcaattt 294 900 
tatagtcagt taagattgaa tacatgttta attcctgcct ttattgaata ttcattctgc 294960 
aactacttgt gctgggcact gtcctaggtt caaataattt aacaataatt aagagagaca 295020 
tgcttcttct agaaaagatc* aatgatttga aaacattttt atttcaggac acttttatac 295080 
tcttgaaaat tattgaggac cccaaagaac tattttttat ataggataca tccatcaata 295140 
ttcactttat tagatattaa actgagaaaa aattttaagc atgtgttgaa ataataaact 295200 
attttaatat aaataatgca tttattaaaa tattttctat tagaaacatt ttagagaaga 295260 
gtaaaattat ttcatatttt tgcaaatatt ttgacatctg actgaagaga agattttcag 295320 
atttttacat ctgctccagc atttagtcta ttgcaggctc ttgttttggt taaaatgtat 295380 
gaaggaaatt ttgccttgca cggatatgga aaagagagga gtatttcaat taccttttca 29544 0 
gaaaattgtg aatatatatg tatatatatg tgtgtgtata tatatgtgta cagatagata 295500 
gatagatgat agatagataa ctacatcagc actaaacaag tggtaatttc ttacatgttt 295560 
gttgcaatgt gtaatccgga actgtatcag tgaaattctt ataacctgtt acattaaaat 295620 
tgattggtcg ttctttcact ttgaagagat cttttaatca aaagtgattt tgcagaattt 295680 
tgcattggtc atttggaaaa ggtttgttca tgaagctttg cagattttcc aattgttgat 295740 
acattttatg atacaaataa cacttgttaa taccaccact gttctcaaaa ggagttttaa 295800 
gtgtggaagc cgtaagctgc caagctaatg gtggcacaca cacgttttcc gaaattctgc 295860 
ttttcacttg aaagctagaa ttttataatg aataacaagt actgtcagat gtttccctta 295920 
gggtcacttt gttcatgttt gaaaaagaaa agtctgccaa atgcctactt atgaattacc 295980 
atggtttgtc agtcattctc tcaattagaa gcgaagagct accaaaaagt gggcagcgaa 29604 0 
ctctcactcc aacgaccgca tgagcgcccc ctgccgtcaa ctgtcacgct gccgtgcagc 296100 
agaagtgctg gatttcatct gaggatgtga aaaaggcatg tactcaggat ggaaattcca 296160 
cataatgaac acttttcgct gcttcatcaa ggacgtgctt aactaaaagc agcatcgttt 296220 
ttggttgttt gtttacactg ggtgtgtagc agtgaagact tcagtggcac ctagtgcggt 296280 
agctcctcga ttcttcatct gcactttcgc cattgatgcg ccactagttt tgctcccctt 296340 
atgcaatgtt agcacagtga. aaaagacaaa tacatttcgt tttgttttgt tttttttgag 296400 
atggagtctc tctctggcgc caaggctgga gtgcagtggc gtgatctcgg ctcaccacaa 2964 60 
gctccgcctc ccgggttccc gccattctcc tgcttcagcc tccctcccga gtagtgtagc 296520 
tgggactaca ggcgcccgcc gccacgcccg gctaattttt gtatttttag tagagacagg 296580 
gtttcaccat gttggacagg atggtctgca tctctttacc tcgtgatccg cccaattcgg 296640 
cctctcaaag tgttgggatt acaggcgtaa acgtttctta atattatggc aataaatata 296700 
gttttggctc cagagtcccc ctaaaa'gtgt attgggacac ccagagacct gcctgagcac 2967 60 
cataactgga taactgctgc tctagattct aaaactttct ctgagttgct ttatcaattg 296820 
tccttttctt gttttcttac acatgggaga ttcttcctca ttgtctctga aaatcctcat 296880 
gtttcttatg gttattttat atggcttcat gatgcacaca cacatcctta tatatatttt 296940 
cttccgaatc ttctctaaca tgccctcttc ttgcctatag atcatgacag tattctacgt 297000 
acatattttg aggagttttt attatgttca ctctgggaga gctttcgctc ctcagggtac 297060 
aggagacttg tgaaccccag agtcagtggc agtgcctgca ctccctagct tgctgctcac 297120 
atcggtagct gaaatgcctt agtcttgctt cagttatttc ctttaaatag atgctaggtc 297180 
gaataggtgt tgtggcctct tagtaactga gtgatgggag aggtcatttc tttgcttata 297240 
agaacacact gctttctaag tgagaagagc ccagcttcct gacagagcac cattagaaga 297300 
accctccact gtccttgtta ggttatagtg cttatcaatt tctctccttt acagggccca 297360 
atccaatgag tgacctgttt tgggaagcaa acatgttaaa agaaacccta cttgaaagtt 297420 
aaagactaca attgcatttt agaatataga ctagtgctac caattgtgat tgtcctcctg 297480 
aacgctatgc ttttgtgaca gttaatggaa tttaaaaagg aaaagtaaac gtatctataa 297540 
cctaactttt gtaatggaaa ggattggatt ggaatggaat cctaatacat tatttctctc 297 6Q0 
agaattaagt ccattgttct taacctgtta ttcttcagtg acctcgcagg actgcaagct 297660 
acctaaaact gaatgtaaga ttttgtgagt tttttttttt tttccctgca tgatatatag 297720 
acatatatac actggggaaa gaatatcaaa aggtacattt gttttattca ttcaaatatt 297780 
aaatatccaa atatttctta tatacacagg tataggtatt tagttattgg atggtaaaat 297840 
atatatattt ttatatgtat gttttggtca tttcacttgg taaaatagag catcagtgtg 297 900 
cactatggac agaagcagta ccatagtttt ttcctcttat atcaactttt cttctcattc 297 960 
cctctttttt cttctgatcc atctccttcc cctccctcaa ttccagtttg tttttctctt 298020 
ttaaagttgg ctatggacag caaagacaag tgaaatgtaa tttaaaatac gcacatttaa 298080 
catattctat attataaatc atgctgctat aaagacacct gcacacttat gtttattgcg 29.8140 
gcactattca caatagcaaa gacttggaac caatccaaat gtccaacaat gatagactgg 298200 
attaagaaaa tgtgtcacat atacaccatg gaatactatg cagccataaa aaatgatgag 2982 60 
ttcatgtcct ttgtagggac atggatgaaa ttggaagtca tcattctcag taaactatcg 298320 
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caaggagaaa aaaacaaaca ccgcatgttc tcactcatag gtgggaattg aacaatgaga 298380 
acacatggac acaggaaggg gaacatcaca ctctggggac tgttgtgggg tagggggagg 298440 
ggggagggat agcattaggc gatataccta atgctaaatg acgagttaat gggtgcagca 298500 
caccagcatg gcacatgtat acatatgtaa ctaacctgca cattgtgcac atgtacccta 298560 
aaactttaag tataataata aaaaaaaatt cttcattgac ttacatgcaa tgaataattt 298620 
ataattagcc tatgttacta ttcattttcc ttttattttt ttcttgcaaa attatgttag 298680 
aactgtgact gaaatgtatg-agtttctcaa tagatgctat tgattgagtt attttgaagc 298740 
acaaaaataa attgaattgg ttaatataca ctttgtaggt caaaactggc cctgggaatt 298800 
attttttaga gctattccac attattcaac ttcgtaagct tactgttggt tatttatata 298860 
tttttaattt ttgaattata atagaagcag tttaggactt aaatataaaa atatgtaaaa 298920 
tgtaaataca ctcatttttc attttaaaca catgctttca tgtcataacg gatcccaaaa 298980 
taccaaagat attaggggaa aaaataaact gttgttgtct taaaatacat tcctttgttg 299040 
aattcatcct gctaaccctt tagggtattg acagcctatt atttttcata tgtgccatct 299100 
ttatctagaa aactgagaga acaactaatt atatcctttt gcaaatgagt gcaacagaat 299160 
ggcttttgtt caacacaatt ttt'aagtctg ttgaacagga ctgttgaatg gcttaaaaat 299220 
ttgccatggg gtgtgtgtgt gcacatatga gaatatgtat gttaatattc agccacctta 299280 
gtgtcatgga aagcgtggcc tggggatttg attatacatt gtgctttgtc ataaatttaa 299340 
tggtttcttt gcaacattat tggagggagt atgcctctat gtctttcatt tagcttgtat 299400 
atttttggta tgatagacat ggtaattact gctcttaaag tttgtgcaaa tgccactggg 2994 60 
aaaggggata ggagtaaaga attaaagtga aggtcatgaa ataaacctat taacaaaagc 299520 
agagtgattt cttgaataat cccttcatct ctccccatgt cataatcctt tctcataaat 299580 
aattatacaa gcaagcctag acacacacta gtgttgaaga attgtgctgc aaaatgcatt 299640 
tagctgcatt aaataccaag agaagccttg agttccaaaa cctccttaaa gtggaattct 299700 
cagttgtttc cagagaagag gcaagttata gtacatataa tctattatga acgtatctga 299760 
gtagaacacc atgcaataag gcgctattga attgggaggg aggtatcaga gctaaagagc 299820 
acaaagacgg gccatcgaga tattagagga gatttctctt catatgtaat tctgctttac 299880 
atgtttcctc aatcttattt cctttctccc taccgcctct ttttttttta gccacaaatc 29994 0 
attgcgtttg tacagattac aggaaaacac tcaaatattg ctcgaagaag cataatcttt 300000 
cttctgcagt tggtaataaa tgagatctac ttgtaaaaat tagtgtcatt gtctacaaca 300060 
ctttcagatg actctcatgc caccaggtca atattcctgc tagggatgag ttagttctta 300120 
cacttgggtg atcaccaaga atgtgggact cataagggca atggatgatg caggcgcata 300180 
gtgtcctttg atgttactgg aggtgtaaaa tgaaggtttt ttttcattct aaatcaaagt 30024 0 
ttaatggtat agtaagtatt tattatcttt atcctggaag ggattttatt gcttttgaca 300300 
tttttaaaaa . tgtcaaattt taattttaat ttaattgttt aaaacaattt ttaaactttt 300360 
atatrgtttt acaattcagt ggaattctct cttctttata ttattgtact tgcattaata 300420 
atattttaga tgttttcaat catgagttat tgtcactatg ttttattaga aatatccttc 300480 
tcactatgag ataaagggac gaaatgagtg ctaatgtaat tctacagggg tcagatagca 30054 0 
tttccatggt tacattgatt cactagcagc atgtcctgaa taactcctgc tgtattgcaa 300600 
accccttgcg ggcagggtct agtatatttt taatgtttct gcatatagct ccaatttagc 300660 
tccaattttc ttcctacatt gtagtactag atcaaaaaat acttattggt tgaacctata 300720 
atgtatgcta cttaggcagc tctggcatga ggcatgacat tatattttaa ataaagaaac 30078 0 
acattccagc attccaaaat tctcaaagca taaccctgtg taaaaccata gttaaagttt 30084 0 
caaaatttga aacacgaaaa ctgacttttt aaaatgttaa ccatacaatg actaatgcag 300900 
tgtgctcttt gcacgtgcat aaacaataag aaacattatt aaaataaaat atttatattt 300960 
taaagtaaac atacactatt taaaatactg agcaaacatt tatataagta ggggatttat 301020 
ataaaatctg aatgatgaaa gtacttaaaa agtactaata gtaactaatt tattgaatct 30108 0 
ttgtggcagt cttcatatga ataatttaac caccagaata gttacggaag gtaggtacaa 30114 0 
atgaataacc catttgtata ggagtaaact gatgcctaga aagagaaagt cactaattca 301200 
agctcataca acgtacaagt agaaaagtga ggattttagc acaggtaatc ttgatacaga 301260 
atgatttcca ttatttttaa ataagagtaa tacatgtatg tatatgaatt caaataatac 301320 
agaatgataa aaatggcaga atacggagcc cccttttcca cccctcgagt tccttctccc 301380 
caaccaatat ttatattaat agcttatttt atttatctta aaaatattat attttacact 30144 0 
cattatttct atctatatct ctatatagat atatatcttc atcattataa aataagatat 301500 
acagtgataa acctttctct taatatgaat ggatgaactg atctaccata tattattagt 301560 
agctatattg aattccgatg tatggatggg cctaattatc ttttttatta ataaaatttt 301620 
agtttgttta tatcattaat attctcatat tttgctattt gcaaaatatg ctggaataaa 301680 
caagagttaa ttttatttct atttactgta aattgtgtca ttattactga agtaaaagga 30174 0 
ttgagatgac ataatgccta tcagtgcctt taaaatattt caataacaac aaaaaagaaa 301800 
cctctttcaa gtgtgcaaca tcctgataat taacttatct gggtaatgag tgtatgaaag 301860 
ttcattataa atgatatttc tatgtgtgtg tgtatttaga atttttacaa tagttttgtc 301920 
aaaagttatt ttttaatgaa gcatctgatt tttagggagg gatctaagct ttccacacat 301980 
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gcttctattc tccatagtag tgtaagaatt 
aggaggatta agtgatatga atgaaactgt 
gctgaaaggg ggttctacaa agaagtgtga 
atgtcattcc aataaggact atatttatct 
gagaatgaag agaaatagtc attagtgcaa 
ttggagatat tttctctctc - tctctctctc 
aggctctatt tatctatcca tcatctctgc 
actcatatct ataccgatat ctatatttat 
tggtaacata tggcagattt gtgtatacct 
tctctgtcat catgattaag gggagtaatg 
tggggacagc tatttttctc tgttataaga 
gatgacttct aaactcttga tgggagactt 
gcaatgggaa ctacacaccc aaaggccaag 
gagagcccat caatcacgag aagatggcag 
ggacagctta agattgaaat ataatagaaa 
tattttatta ttattattgt tttgtaatgt 
taataggtta ctagattttg taactaaata 
a-aaagtcaga aatgatcctc atccatttta 
tctcttcctt cttcaacaag gcactacgtt 
gctttgatca ttcagggtta gtcttcattg 
agtcttaata aattcttcct attgggtggt 
taactttaat tactgaacaa tgaagcatgg 
ttaattatac atcatggtcc aagggctcta 
taactatcgg tgtacaaagt cccacagatt 
ccccagtagt tcagtggcta aagtgggctt 
gaaagttttc tggctagaga gttaacagct 
atggctgata aaagtatcaa tgttgtatat 
ctcaaaccac tgttaagctg tcacccatag 
gttttatctc cagacatttt tctgcccaga 
tagtttgttt tagggaggat tgtgttggtt 
tcaggtatac aaacacgtga atgtatttgc 
ggtatagttt caaattgctg aagccaaata 
ttgtcagaat gttggttctc tctcatattt 
aactaatagg tttttatatg ttctaattat 
agtggtaaga gagaaacata tggttagaga 
actgaaaaag tctgctgtga atacagtggt 
gcccatgcaa acagaaaaaa aaatggaatt 
attgacttac acggattaca aatttggtat 
cactctttgc atttttttgc tacatcatct 
agtcaagtta atatttaaag tcacaaagag 
cgtacctgtt gtactatggc atatgtcttt 
aaaaaaggaa ataaaaaatc ccaaaaaaca 
tcagggttga tattcctcaa aagaagacat 
ttcaacatca ctaatcaaaa atcaaaacca 
tggcaattct ccaaaagaca aaagataacg 
attgaacatt gttggtggga atgtgaatta 
ttcctcatga aaacaaaaat agaagtatca 
tatccaaaga aaatgaaatc ggtatgtcaa 
gcattattca caatagccaa ggtatgaaat 
ataaagaaaa tgtggtatgt atacacaatg 
tcatgcgaca acaccaatga acctggaaag 
ggaaaacaaa caccgcatgt ttcactcatg 
tagagtacaa tggtagctgt cagaggctgg 
gtcggtcaga gcatacattt acatttgata 
tagcaaggtg actctaatta atgaaaatat 
gtaatatgcc atcaccacag aaattgtaac 
agatttaaca agtcaacaat gaatatacac 
acatacaatg ttatctgtca attaaaaaaa 
gggtatttat aaaatgcata tttactttgt 
gatttatttt gcttaataga atagaataaa 
agtgatccct ttagaccttg atgattttat 



tcaaatgtgg acacgttaaa gtatttctag 302040 
tggggacctt acattttatg tgtaatgaga 302100 
taatagagga cagatttttg gaatgaggta 302160' 
agggatattt tgaaagaaac tggtgatgag 302220 
acatgtcaag aaagaaaaca tattacttct 302280 
cccctgcctt cccctgtccc tctctccata 30234 0 
atatgtcttt atccatactt atatctccat 302400 
atctgtaagc tgatcttagc ctactcatca 3024 60 
gttatcagtt cccaaatctt tacacacact 302520 
tgctcttacg tttttgagaa ttgggcccaa 302580 
ttccatagat tgtggcaaca atggcaccat 302640 
gcttctgtga cttaatggat tattagccct 302700 
aggaacatgc aaaggcaaac agagagacaa 3027 60 
gaagttgtgc cgttcccatt gcagttattc 302820 
gttatatgac gtactctaca gacaagttta 302880 
agaaaacatc tacccctaaa aatctgtgga 302940 
agataataga caaaatataa tagatataga 303000 
gaaacccaat caatttcaat ttaatccaaa 303060 
atcctcatcc ttccagcttt ccttcccaca 303120 
cccatacttg ttcctatttg ttttaaataa 303180 
aataagatat attcaataag gtgaaatgtc 303240 
tagaggttta tactttaata atataattta 303300 
ctttaattaa tattttceac tagcaaaaaa 303360 
agacaccaga accttaggca ggtagtaagc 3034 20 
taagtaccta aggtttatct ttattcatgt 303480 
gagtgcattt ttagtaaaat actgcaggaa 30354 0 
ctgcttaaag gagatgcata tctgttcttt 303600 
acttattgta gaaacaatta tctcacagta 303660 
caaaagaatc tatggaagta gtcatttctc 303720 
ctcctatgag aaatcacttg cacattatgt 303780 
tatacagagt ccatttctta cacacagcaa 30384 0 
aaccaaatgg gaa'gaaattg atggtaaact 303900 
atgaatctgt caaaggatag actttttgta 303960 
gcaaggaaat ctatcctgtc ttagcaatca 304 020 
tcagggaaga atatgttaag tgaataggac 304080 
gggcagccat tgtgagatta agtagtaaat 304140 
gtaatataaa gtcacaaata taatttgttg 304200 
attgagcttt ccgggcttag tgaataacat 304260 
ggccacaaaa ttttaaagac tatttatgaa 304320 
tcctagaaat tttgaacatc acactgactt 304 380 
agtcaactaa ttatttttct cgttaacaaa 304440 
aataacccaa taaaaataat gagtaaaata 304500 
acaaatgaag agcaagtata tgaaaaaatg 304560 
cagtgagata acagctcacc ccagttagaa 304 620 
tgtttgcgag gacgtggaga aaagagaatc 304 680 
gtatagcctt tacgaaaaac agtatggaga 304740 
tttgatccaa cagtcctact actgggtaga 304800 
agaagtgtct gcgcttccat gtttatgaca 304 8 60 
caacctaagt ggacatcaat ggaaaaatgg 304 920 
caatactatt caaccataaa aagaatgaaa 304 980 
cattatgtta agtaaaagaa accaaacata 305040 
tggaattcaa atacattggt cacatgaagg 305100 
gtgtttaggg gagaaatggg atggagagat 305160 
gaagaaatga atttcaagag atctattata 305220 
gattctattc ttgaaaaatg tagaatgaat 305280 
agtgcaaggt aatgcatttg tgaattagct 30534 0 
tttataacat catgttgttc atgataaaaa 3054 00 
tatctaggaa gaatctgtac agaagattgt 3054 60 
aagtcaccaa atttgtgaat aacattattg 305520 
taatagaata aattataatt aaatatttaa 305580 
aaatataggt gagatgctag tttgcttacc 305640 
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ctaaatatct 
acttccatca 
atgtgtgtat 
cattagtgat 
tataagcaaa 
gaagcagtac 
aaccagtaag 
ccctcccaac 
aagtgtgttt 
agatacagaa 
tgatttgact 
gttgactctg 
ttgagaaagc 
ttgcatagta 
aagttactca 
actcaaagct 
cgaagcaatg 
fcacagcatgt 
ggaacacagt 
tgcatcaccg 
tcctcaccca 
gaagtatttt 
gtttccccca 
tatttgttag 
tgtgatctcg 
ctcccgagta 
taatagagat 
ccgcccacct 
atctgatctg 
tcgctgtgtt 
gtggcctccc 
ccagtcttga 
tcaaggaaag 
aaacgtggct 
tcaagttgtt 
gaaatcttta 
ttaatgttct 
cctggaaaac 
acgatatata 
gagaatgcaa 
tatccattca 
cccccgaacc 
atacagatta 
tggttttctt 
ccttgacatt 
ttaatttagt 
ccttaattaa 
gtatgtggtg 
attctccaat 
tattgatgtg 
tatatgaata 
catgtgcact 
tgaatatgga 
cacacatagc 
aacattggaa 
agctggaatt 
agtctctttc 
aggaagatgt 
tcattgtccc 
gtcttattgg 
aagaattttt 



attttctgcc 
ccaaaattgg 
ttttagcaag 
gcaactgact 
atgacatatc 
tggctaaate 
gtagagttcc 
tagaattctc 
tattagcttg 
catatgaaga 
gttattccct 
ttccaatcct 
atgacctatg 
caagcctcac 
tctctggaga 
agcacttgca 
aagtaatagt 
taggatgggg 
gcctgacaat 
caagtacctt 
aatctcatct 
tccccacatg 
tgccgttctc 
tttgttttga 
gctcactgca 
gctgagacta 
cgggtttcac 
cagcctctca 
atggttttat 
aagaaggttc 
cagccacgtg 
gggtatttct 
aaaggtacat 
tgatcatgtt 
ctagaaattc 
cagtgaaggt 
agatgtattt 
atttagcctg 
ttttgaaaaa 
ctaatctcaa 
taaaattgat 
acaaaaatgt 
agtaaacatt 
actgggttca 
ttcactgtct 
tttacaatct 
aataaacttt 
atgagtagtg 
tattagctat 
gaaaaagggg 
tgtaaaaatg 
ttgtgtgagg 
cacaaataca 
cctttgctaa 
ggccatgtag 
ctattctact 
tttttcatgt 
atgaaagcac 
ttaacagatg 
atgaaatgat 
gcttcagctc 



ccagaaacaa 
caattaagaa 
aaagccagtt 
tctgcacacc 
attacacata 
-catcagtgta 
cgctccttga 
aaatgtgttt 
ggattccatc 
atccctgagt 
tcactcactg 
cagagcaatt 
aaaatgagtt 
tgatgatcac 
ttgtatccca 
ttagtattag 
agaaaattcc 
accacacctc 
aagtatgtgt 
tagaccccaa 
tgaattgtaa 
tggaagaagg 
gtgatagtga 
gacacagtct 
acctctgcct 
caggtgtgtg 
cgtgttagcc 
aagtgctgag 
aagcatctgt 
ttgcttcccc 
gaactgtgag 
ttatagcagt 
tgtagaatat 
tgttcattga 
caactattat 
gaatctggta 
gataagaacc 
tacaaagatg 
aacatttagt 
aattttaaat 
tgcaatatgt 
gtttatttat 
ttagtaatga 
gaaagagaag 
tgctgcaggg 
cattgtatct 
tggtatttgg 
catgaacttt 
gtcagttagg 
caataaatcc 
actaggctag 
gagtgtaata 
gatacatctt 
aatgtgatca 
cttaagatta 
tgtttcctgt 
aaatgtacat 
tgtcacaatg 
tctttaaaaa 
aataactgta 
cttgaccttg 



tcctttaata 
tattttcaca 
ttttgagctg 
atcctacaga 
ataactcatt 
gtattggctc 
tataagaata 
gtcggtggct 
tgtgaagact 
tcctacatag 
gaaaaatagc 
cttgccaaac 
gaagctccaa 
tatccaccat 
gggtctagca 
ctgagaattt 
tcttcatgtg 
ctttttactc 
tgagtaaatg 
agagcataat 
tccccagttt 
gacctaatgg 
gttcttacaa 
tgctctgtca 
cccgagttca 
ccaccacgcc 
aaggcggtgt 
atgacaggtg 
catttcccct 
tttatctttt 
tcaattaaac 
gtgaaaacgg 
caaaatatct 
gaacccactc 
tgaggcattt 
ataatacgta 
ttattctcag 
tatatttgtt 
ctcctaaaat 
aaataaccag 
tagtgctggc 
ttattcattt 
gcttacagat 
tatttttcag 
agttcaggag 
ggcaatttaa 
atagcaggat 
ggaattcaca 
gcaaattgcc 
tatcattcaa 
tccctgacta 
tccaatttct 
taccatttac 
tggaattagg 
aaaacataga 
cctatcttga 
atccatgtca 
accaataaaa 
tatattaaat 
cacatgggtg 
aggttgaaat 



gaacacttta 
cagaaaaaac 
tatgatggag 
agaagaaact 
gtggtgggtt 
tctattctac 
aggattccac 
catccaatat 
ggcatttcta 
ctgtcaagaa 
tcttcccaga 
ttcttttgag 
accagcccat 
atcttagaga 
caagtctcta 
tttaaaaaaa 
aaattcctgc 
acccattata 
aaccgctgtg. 
acagatatgg 
tcttactggg 
gtaactgaat 
gatctgatgg 
cccaggttgg 
agcgattctc 
cagctaattt 
cgatctcctg 
tgagccacca 
gcttgcactt 
gccatgatta 
ctcttttctt 
actaatagac 
cagtaattct 
tattggcttt 
gcgtattaaa 
acaacattat 
tattctgagt 
atatttttct 
tataaaaaaa 
tgcttatagt 
agcttagtat 
taaaactact 
tttatacgat 
tggggattgc 
catgtataag 
gacatagaaa 
aatgaatacc 
gaccttggct 
ttaattcttt 
agttgtaatg 
tgggacatag 
gtattatcat 
agactacttt 
atttcataaa 
atttgcagac 
gcaagccact 
tagatttgtg 
ggagaggatt 
ttgaaacatt 
aagattagtg 
ttaagcctcc 



cctttcctta 
cactattgat 
ccttttcact 
catgttcgta 
ctaagcaagt 
ctgttcttaa 
cttctcctgc 
tgcccaagga 
tcttactaac 
gtaaatgaac 
atcaagtatt 
aggtgctgcc 
accacgtgat 
tgggatttag 
cacaagtggt 
aatcggagaa 
taacatgttg 
gcccaacgat 
atcctcccac 
tttggctatg 
ttcagaaaga 
aatgagggca 
ttttgtttgt 
agggcagtgg 
ctgtctcagc 
tatgtattct 
tcctcgtgat 
tgcccggctg 
ctctctcctg 
taagtttctt 
tataaaataa 
aaagaaagcc 
ttttcatagg 
gtatcttgct 
ttacttttta 
gctaaccact 
aaactcgggg 
taattgaaag 
tcattgttta 
gtagttgtat 
aacacatttt 
ttatagtaat 
catgcggttt 
ctagctgcaa 
actattctat 
agtaactttt 
cagacaggca 
tagatttcag 
aaactcaatg 
agcagtaaag 
catttatttc 
actgcaacta 
tccatgtaca 
attgaattat 
agacacaatg 
caacattttc 
atgtctatta 
caataaatgg 
gtttcagagt 
attggaagtc 
tcatctagtt 



305700 
305760 
305820 
305880 
305940 
306000 
306060 
306120 
306180 
306240 
306300 
306360 
306420 
306480 
306540 
306600 
306660 
306720 
306780 
306840 
3.06900 
306960 
307020 
307080 
307140 
307200 
307260 
307320 
307380 
307440 
307500 
307560 
307620 
307680 
307740 
307800 
307 8 60 
307920 
307980 
308040 
308100 
308160 
308220 
308280 
308340 
308400 
308460 
308520 
308580 
308640 
308700 
308760 
308820 
308880 
308940 
309000 
309060 
309120 
309180 
309240 
309300 
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tttttaaaaa acgtttctaa cagaaatata ctttgtggat cattatacct cattctaatc 309360 
cagctccctc aagcttaatc tgtatgaact ttggggaaaa gataagcact gtgaataagt 309420 
tggcatgctt gaacattctt tgaataagta gaaaatattg ctctttcatt ttattgacag 309480 
aaagattatc ttgatagctt atacttctgt ttttaatgta aataataggt atcaaattct 309540 
aaaattgaat aatgttgacc aatttaatga gtgtctgtaa taaggggaat ttataaatgt 309600 
tttgcactaa ttgcagtttg" taaagttatt gaacttcctt ttatacttag atgataattt 309660 
attatttatt tatttatttt tgagatggag tcccactctg ctgcccagag tggagtgcag 309720 
tggcccgatc tcagctcact gcaacctctg cctcccaggt tcaagtgatc cttctgcttc 309780 
agtctcctga gtatccggga ttacaggcac gtgccaacac gcccagctac ttttttgtat 309840 
ttttagtaga gatgaggttt cactatgtta gccaggatgg tctccatctc ttgacctcat 309900 
gatctgccca cctcggcctc ccaaagtgct gggattacag gcgtgagtca ctgcgcccgg 309960 
accaattttt tatttttatt gacaatgctg ctttggataa agaaagttac tgaacgcctt 310020 
ggtcaatgca actgtggtaa agagaacaga aatgatcatg cacaatagca agatttcagt 310080 
taaggttctt ggttgcaagt gataaaaatt taaagcagga aaataatcta ttagaaggct 310140 
atggaataga tgtcagagac aaatgttaag gttctgaaat tgggeaataa gcaaggaagg 310200 
aaagacctca aaggcattaa cctgcagtgc ttctgaacgt gggctctgga gctgaagttc 3102 60 
caaatcttcg ctccaccact taatagaact ttgtgacttc gggcaagtga cttaaccttt 310320 
gtatgtccca gtattcagat cttcaaaatt agactaccct tacagctaac tgagttaaca 310380 
cgtgaaagac acataataga tcttcaaaaa tgtccagaaa aaaataaatg tggccagaac 3104 40 
aatggccatc acattataca ccatggactc cactggacgt ggttatggaa aactaatctc 310500 
gctgtccttt tgaatttgca taectccaca aagttcctga cctatattgg attttgcctt 310560 
ggttgagagg agaacagaag ggatgaaatt tctttacttt ttctctaagt gaaaggtata 310620 
attccctgaa actaagaaga ggagggattt ggatgttaaa cagccacaca ttttgaaaat 310680 
gtccactaca gttagaataa ccatatatcc tagctacctt gagaaagtcc cagtttacaa 3107 40 
ctgatatccc acatggttaa acacaccttc gtggcagaag tgtcacaatt tggatgataa 310800 
attatggtca ctgtgattgt cattcactcc cttttgctgc taaaatgtgc atatattctt 310860 
cttcttatac ttactggtcc agtgraaaaaa gaccttaata acaaaattaa attttcattt 310920 
ataaaaacaa taatgcattt attcttttca gcaaacaaaa caaaaacaca acaacaacaa 310980 
caacaaaaaa ccagaaagtc caaaatacta ttattctgct atctctaagt cttggatatc 31104 0 
tgtggactat tttttgactc taattttttc taatcctatc tcagtatttc atcacctaca 311100 
gagtaaatta gcatcgtccc cacaacaata ccctacatac ggtatctata tctatttata 311160 
tacatgtata caaatatata cataaatata tatatgcata tatacatatc tatatgtata 311220 
tagatagatg tagatatatg aaaggtatat ctatatacag taggcctcca ttacctcaga 311280 
gacacatctc aagaccctca gtggatgcct gaaaacgtgg atagtgccga accctatata 31134 0 
tactatgtat aatttttttc tttgtcacca tttcacagat aaaagattca ttttgctgta 311400 
tatcttagca acctcagcat atgatttttt tccctttctt tattaagtca agaacttgca 3114 60 
tcattttcct taatggaagc actttacagc ttctgtttgg catagccaaa tggccagtat 311520 
cactgctttt gtgctttgag gccattatta agtcaaataa ggatgacttg gacataagca 311580 
ctgaaatagt caatctgata gctggggtgg ctagtgagtg actaatgggt ggggagcaga 311640 
gacagcctgg agaccctgaa caaatgaatg attcatcccg agtttgatgg agaaagagca 311700 
caaaacttca tcaccatact cagaacagca cacaatttaa aacttatgaa ttgtttattc 311760 
ctggaatttt ccacttaata tttttggacc atggtttact gcaggtcatt gaaacctaaa 311820 
aaagcaaaac tttgaataag gagaaactac tctgtgtgtc tacatgcaca tgtataagta 311880 
gtctctcttt cacacacata tacacataca ggaaaggaag agaagacatg tggattctag 31194 0 
tttcctcatt tccatacttg attatgaggc caaattgagt atttatgact tgcctccttc 312000 
atgacccagt acatactccc tctactttcr gggaacactt ctgtaatatg ctagaatttt 312060 
catgcctaaa gattctaagg gtttttcttt ttacatatag' aatgaaaaga gtctttcatt 312120 
aacttccaat taatgaacat gctggttctg tacaccatct accacccaga acgctgatct 312180 
ttcaagatgt tgtttcccat ctggcactat agtgagtttt ccatgaatca tccagtattc 31224 0 
cacaagggct acacagtaag aaccttgaat cctatgggaa taagctcaat gcctaacatc 312300 
gtttgcatta tattctttgt ttagtagcaa tgatatataa aagttgtggt tatagagttt 312360 
gttgaaaata tctgaagaag gtacagacag caaaggcaga cacagaagaa aaatatatca 312420 
attttttaac agcagctgtt ctactgaagg caaatcactg cactctccat gattcaagaa 312480 
atcaactacc acccccccaa cccccccctc accacacaca cacgcagtac tatttgctag 312540 
catcagtctt gctcctctat cagggcctct aggttgtgct cagtatgaat gaattagaca 312600 
ctcaggggtg ccattagctt ggcaacctgt ccattttacc atgtcctagg catgtttcta 312660 
tccataagac ctttgtcact tgacttaata aggagacaca cagtggccag gaaaagagat 312720 
atttcatcac agaacctgct gccatgttgc ttagaaggag aaggagctag atatgtgact 312780 
atgtgtccat tcttgagccc tggcacttgg gttgtcttga tggtcaggaa cttggaagga 312840 
agacaattga aatattgatg acaagaaaat ttggggaaga gacatgtggc tataactctc 312900 
agaatgaacc aaaacttgaa gatatttttg gatgtttaaa aaaggatgac ctcagaatag 312960 
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gaggatttta acaattaaaa gaataggatg acttgttttg tgggtgccag tcagcctctt 313020 
tactcagtcc tccctgtcac ttcccattgg gctcatgact aaaatagcca ttgtgtcagg 313080 
gatggaggtt acgcatgggc tcagctacat ggacttacac tcaccaaggc tgccttccta 313140 
ctatcagtct accagctgta gagaccaaca ctgtgcaccc aatgtgacac tattccccag 313200 
gggatcaggc atctacctgg tggcaggcag atcacagtgg acagcctcca tcatggaaag 313260 
ggaagtattt tgtttctact- ggataagatt ttggatgtgg atgtgtctgt cctgaacaca 313320 
atgcttctgc caaacacgga cttacagact gccttatgca ctgtcgcagt attcctcacg 313380 
gcatggcttc tgatcaagga actcacttca cagaaaaaga agagtgccaa tgggccatgc 3134 4 0 
tcatgggatt caccggtttt atcatgttcc ccaccgtcct gaagcagctg gcttgataga 313500 
atggtaaaat gaccttttga agacagctct agccccagct aggtaacttc acagggcggg 313560 
ggcgagggtc tctagatggc tgtaaatgct ctaaactaac tatccgatat atggtgtcat 313620 
taatttttat ccaaattcat cttcttcacc tgtgtattat caatagaaaa tagcttaatt 313680 
tgtcaatcta tcttgggtca aagtttacca taaagctttt catgaatttt agtttcaaaa 31374 0 
atttacatct tatgtaatag tgtgtataca tttatatagt aagaaataat atagtaaaga 313800 
aaatcttaaa cacatggcta agtttggcag aatgtcataa agatcacatt gtacaacaaa 3138 60 
tgccagaaat tatagtgctg tccataattt tgtttctttg agactgattc cactggtttt 313920 
caaatattgc tacagttgaa acaaaattaa aagaataaaa actcagaatg atcacataga 313980 
gtggctaaat taatgatatt caagttctgt ttcttcatct. ttataatcac ggtgttataa 314040 
ttgtttattt caaatcaacc gctcaaacac aggaatccat agcaccaaaa atttttgata 314100 
cagaaacttc actgagggca agcctaaatg atcccaaatc tctatgaacc ttttcttgaa 314160 
atgaatgcat gtaactgagt ccgtgtttgt caaggccatt tgtaaaactg gaagttctct 314220 
aatttaacat ccatgtaaaa tacaatattt ataaaacaac ttgtaataaa tgtatgttaa 314280 
tttgatgtta catatatatg tatatatnca attacatata tatatatata attaaattca 31434 0 
agagaaacca agtgattgtt taggactgag accaataaaa tatttaataa aatatttttg 3144 00 
gagggaggac ttgtaatcac gaactggggg gaatatccac gggcaaacca tgaaccagat 314 4 60 
tcaggacaat gtcgttgtcc tctgaatccc aggtaagtcc actttgactc ttaaatggca 314 520 
gaaggactct gggttttcaa aacccataag cctgttatct aacgtatttt cgtttcacca 314580 
gtgcacagct cctcttgttt atggatgcag atgcctttcc agaaggctag aaattataga 314 64 0 
aataacattt ggtattttca gagtctgtat atggaagtac ctacctacca tgtttttatt 314700 
taaagaattt tgaatatgtg atttgactca aatttgggaa ttgtgtgagg aacaaaacta 314760 
ttttggtggc ttcaattagg tctatatgca atagaacgtg aatacttttc aaaaacagaa 314820 
atagaacagt gcctacgtga gctagcacca attatggtcc ttggtactcc atggcctatt 314880 
tacagctaca ttaatccatg agtcttataa ttcaaatggc tccagtggct ttgatgttca 314940 
ttagggacac aacagccctt tgccttccct ggagaagtgc tgtgttttgg gttctaacaa 315000 
cataaatact ccccagtccc agtgttttta tggagcccaa ttcaatgtac gatctccccg 315.0 60 
tggcatcacc tacctataat gattaaggat ttgggagatt catttagtga attttatcaa 315120 
gataacaatc ttaacaaaca ctgtagcata tgcacactcc aaaggctttc cctctttcta 315180 
cgttaaatcc aatggattcc tgggactgtg tctcttatag gcttttactg ggcctaaatt 315240 
ttagccaata agcttgtcaa acagctagaa acacattaaa ctcttcaagc ttacctactc 315300 
tat'ccatgct cttcgggtaa gtgtccactt tgaaattcac cccaatgcaa tcttttatct 315360 
tggttggatg cacaccttca gaacctattc ctaagaaatt cttgcatata ttgttggcat 315420 
aaatgtcttg caattctgtt gttatatcat tttttttctg agtatatttg acccctattt 315480 
aatgcttagc tcttgctcaa gttctaggac tgcaggcaat gaagactggg taagtagggg 315540 
ttaggatgag aattatcttc tcagtctgag gtaacttcct catatgtaaa ggctactaag 315600 
gcaggtttgg aaatgggttt atatatgcat ttccgaatat cctaataata tttggctgct 315660 
tacattagta aaaatcatta aagttgcttt cactcaaaga ctgtaagata taaagaaaag 315720 
ttatttaaaa tgtaaagcat tgaacagaaa acaatcggaa actaatcaat gaaatgggta 315780 
tatataggag gattccaaaa attcggagac tctaaccagt gaatccagcc ttgctagtac 31584 0 
atccctgcag ccagatacag ttgcacttgc tggaccacta caacatcggg tgccaatcta 315900 
accactgaat ccaacctcct tgtacatcac tgcaaccaga tgcagttgca cttgctggac 315960 
cactacaaca tcgggtgcca atttaatact ttcttctttt cactctctac cccrgtttca 316020 
gtatccttgg cagtatgcta tgttcaactg actttgagat gtatgattct attttgacac 316080 
caacaggctt agaaaagagt aaactgaaca ctctggcttt tgtagggaga agtagaactc 31614 0 
cgtgtagcac gaagtcttat gcaaaagtag gaagcagtgt aggaggattg gcagaccaaa 316200 
aaatgaaaag tgacaaatat ttatatacag ccagataatt ttgttataag tcaatactct 316260 
agatttttta aatctcccat ctccaaggca atttaggcaa cagcaagtat cttataaaat 316320 
tctctttttt tagaaaatat agatgaatta tttaagatgg attatcatta tttattgtgt 316380 
atttagtata ggtttattta tctcacactt ctttgtcttt ttccattatt ttcagctgtt 3164 40 
caaaaattct gcatttgtaa caacttcctc aatatgtaaa tttaaaaatc cctcaatatt 316500 
tgccatctta ttctgaagtg tttaccatct tactctaaag taatctgaac accacatgca 316560 
ttattgttgg agttttaaaa cattagtatt tatctgatcc taatgtttta ttgtaattgt 316620 
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atagattttt tcactaattt tggaagggta aactattgtt tcatagatgc aataaatcct 316680 
ggaaacttgt ttgcaatact tttaaaagca gttttcaaaa tttctcctct atcacacagg 31674 0 
agacacactg taatgactac tatttcctct gatttctttt aggtgcgttt cttttgactg 316800 
tattttcatc agtctcatca tttttctgtg tcatccattt gcgctgggac ttgtgcctac 316860 
tgatggtttt caagtgtggc tgcaatttgg ttcccttgaa ggtattttaa aatatgctga 316.920 
tgcactagta ccattcccaa accaactgaa acacattctt caatggcaag gtcatggtgc 316980 
tggtatttaa aacaaaaatc ttcctaagtt attttaatat acaactacaa catattcact 317040 
ggtatagata atattgaata caaaaaaaac ctatatgatg actcgttcct tctgtgagag 317100 
agcaaagtcc acaatttaaa aaaattaata atactgttaa ctcagaggga tcagaccagg 3i7160 
tggaaagcac gaatatgcac tctgcataat gaagcttatt tccagtgtac aaatctacag 317220 
caagcagaaa aggatacttg gtgcagtttg ctctagatac acagctaaca cagcgagcca 317280 
tgcagtgggg cctccaccta gaggaactga ttacaggaag aggaagtttt gatttcaacc 317340 
ttgttaggct ttccttceag ttagtaatgg tttcaagctg aatggatgac acaacatgaa 317400 
aaattcaact acttaagaat agcatgcgcc aagtatattc tataactttc tgttaagatt 3174 60 
catttcactt aataattttc tatataaatc tctattaatt taagtttctg aataactgta 317520 
gggtcaacaa catatttggt actctcctac tttgagcttg gttgaaccac ataatttgaa 317580 
agaggatatt ttataaagtt aaaat;cctga taactttaaa gactgataaa tttctgtttg 317640 
cattcagaat atagtggcag aaaaaaatat tcctttctaa attataatga ttaaattgtt 317700 
ttgatggtat tatgttgcta taaataattt cctagagata ttaaggtata cttacataag 3177,60 
atttattcac atttatgata ttttaggctg accttaaaaa ttatttatga atttaattgc 317820 
ttatgcatat gtgataaatg ctaattataa tatttacttg gctttaataa atatttccag 317880 
ttgcaatttg ataggctaac tgtatgtaaa aatttcataa tattgtrctag aatagttttt 31794 0 
ctttattctg atccttttta gttatagtct tatagatatg gccttagttc aaaagcacta 318000 
tacactcgtg aaacatttct aagtcaatgt aaaatattgt aatgtgatgt cagaataaaa 318060 
tctttttgaa gatagagatt ttcatggttt ttttccaaat cagagtttaa agtagtcaat 318120 
ctcatgattt gtcagacttg aaaagttata gaatgaatat tttaacataa aaataactta 318180 
aatgaatatc cttagaatac aaatcaactg tatattcact cgaggcactt aaaatattct 318240 
atatgccaaa aactgcatat ctctttccaa tttttaatac tgctttctct ctctctctct 318300 
tagtttacaa atatttctca atatttaaac ttttagaact gaatcatgtt ggggaaatga 318360 
agtctttggg gaaataattt attatgaaaa atttttatgt agttattaaa aaaattctaa 318420 
ctcagttaaa caatattata tggaaatgcc atagtttgaa aaattgtcaa ggttaagata 318480 
aaacaaaaaa atatttttga cttttttttt cttttctttt attattatac tttaagtttt 318540 
agggtacatg tgcacattgt gcaggttatt tacatatgta tacatgggcc atgctggtgt 318600 
cctgcaccca ttaacttgtc atttagcatt aggtatatct cccaatgcta tccctccccc 318660 
ctccccccac cccacaacan gtccccagag tgtgatgttc cccttcctgt gtccacgtga 318720 
tctcattgtt ncaattecca cctatgagtg agaatatgcg gtgtttggtt ttttgttctt 318780 
gtgatagttt actgagaatg atgatttcca atttcatcca tgtccctaca aaggacatga 318840 
actcatcatt ttttatggct gcatagtatt ccatagtgta tatgtgccac attttcttaa 318900 
tccagtctat cattgttgga catttgggtt ggttccaagt ctttgctatt gtgaataatg 318960 
ccacaataaa catacgtgtg catgtgtctt tacaacaaca tgatttatag tcctttgggt 319020 
atatacccag taatgggatg gctgggtoaa atggtatttc tagttctaga tccctgagga 319080 
ategccacac tgacttccac aatggttgaa ctagtttaca gtcccaccaa cagtgtaaaa 319140 
gtgttcctat ttctccacat cctctccagc acctgttgtt tcctgacttt ttaatgattg 319200 
ccattctaac tggtgtgaga tggtatctca iitgtggtttt gatttgcatt tctctgatgg 3192 60 
ccagtgatga tgagcatttt ttcatgtgtc ttttggctgc ataaatgtct tcttttgaga 319320 
agtgtctgtt catatccttt gcccactttt tcatggggtt gtctgttttt ttcttgtaaa 319380 
tttgtttgag ttcattgtag attctggata ttagcccttt gtcagatgag taggttgcga 319440 
aaattttctc ccattttgta ggttccatgt tcactctgat ggtagtttct tttgctgtgc 319500 
agaagctctt taatttagtt agatcccatt tgtcaatttt gtcttttgtt gccattgctt 319560 
ttggtattta gacgtgaagt ccttgcccat gcctatgtcc tgaatggt 319608 

<210> 2 
<211> 2226 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'0TR 
<222> 1- .95 

<220> 
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<221> CDS 
<222> 96. .167 

<220> 

<221> 3 ? UTR 
<222> 168* .2226 

<220> 

<221> polyA_signal 
<222> 2200. .2205 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 2 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggectc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtcctgaac attcaccctg ggagctctgt 277 

gctgccatgc aaatgtcaca ttgeactgag gccgccatgc tgtaaaaaac cccaagctac 337 

ctacctgaaa agctggagag aatcagcacc catcagcttt tccagcattt tgggtcctcc 397 

cagctaaatc ateggcattg tgaagcagag atgagacctc ctgccatgcc ctttccaaat 4 57 

cctggaccaa cagaatcatc atatacacta ggttt'tgtgc atctctaaga tgatcatctt 517 

ggtgtggggt cctgatcttc ccagtgactt agcagcacag taatttggct gaaatgttga 577 

tgagttcaga caagaagaag gtgatgatga caagagagcg teaeggggaa attctgccct 637 

gaggaaggaa tagtttgagg ctggagagat gtgctacccc cgccactctt tctcaaatgg 697 

aaagacaacc tgtaggcaca gaaatcattg ctctgcttac actgaaatct cctgctgact 757 

ttgagagctg agcattggat tttgatggga gagtgaaact tagtgaagaa atgatttgaa 817 

cctttgttga tctaatactt ctgtgataca gttatctaaa cagaatgagt ctaaagtcag 877 

tataaagacc tgtgtctctc catgttttgc tgcttataat tgcagaggaa cattttgaga 937 

tagaaaattt tatccctctt tcaagtgact ttttataaaa cataaccttt taagagaggt 997 

cattcttttg aaaaattgtc tgaatacaaa aaagtaactg aacagcaagg gattgtagee 1057 

ataatgttct tgaccacttt agattgacaa atagtatctc aaaattgatc aggttctgga 1117 

gaatctattg aggttgtatg gagtaaccaa gtgatggaag aggttggaag tgaataattt 1177 

tatgtggtat tatgtgattt tttttggtgg gggtagtgtc tggagctgag catgatggag 1237 

cctagatggg gcagccttct aatgctccca gtatttcgea aatcatatgt caegtcaaaa 1297 

ttgtaatgac aaagctcaca gcagctgcct aegecaggtt tggtgccagc tctaattaga 1357 

gcattgatca tttctctccc cttcatgaat gcacatgagt gcacattttc atatataagg 1417 

gcaaaggaca tttcaaagtt tggatggaat ctttcctact tagaagtaat aaaaaacaat 14 77 

actttgaaat cttaccaaat gttaaacaaa tagcaacaat ttttccattt gaagatgtgt 1537 

gtcttaaggt ctgttggatc tttcaaattt ctattttgga catggaaact acagatgaaa 1597 

tttaacttac tcaaagttac acagaactcc tggctgatct cttttactta attcttctaa 1657 

aatctaaagt aagaataatt ttgaaatttt attggtaatg cttaaataac aaaagttact 1717 

ttaatatgtt acatgtttga agtttcattt atacttaata tctaatttaa cgcttgtgaa 1777 

aatattattc agtatttttt tttcttttct aaaatgactt gttcagggct ccttttggga 1837 

ttttaagtga gaaaaactct atttcataat attgttacca tgctcatcca ataacacttc 1897 

atagatggtt agccatctct aaaattaaat tttgtagttg aggagcattg aaaatatcac 1957 

ctatatttgg atttctgttt aacacttcaa aatgcatttt atcataattt aaaatttgat 2017 
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ttaaatacaa acctctttat agaaaacctg ataaatgtag ttttatttgt ttgcttgctt 2077 

ttttttgcag ggagggatat tcccttgata ccactctgtc tttatgactc atcggttatt 2137 

tataaatgtt atcatcgttt ttctgatttt taaataagtt tcagaattaa tacatttaac 2197 

tgaataaatc aaccaataat gtcaaactt 2226 

<210> 3 
<211> 2431 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5 * GTR * 
<222> 1. .95 

<220> 
<221> CDS 
<222> 96. .167 

<220> 

<221> 3 1 DTR 
<222> 168. .2431 

<220> 

<221> polyA_signal 
<222> 2405..2410 



<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-28 6 : polymorphic base A or G 
<220> 

<221> allele 
<222> 328 

<223> 8-155-258 : polymorphic base C or T 



<400> 3 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get: get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 , 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtccttgaa ggtctccctg ctgtgcttag 277 

agagggggat gctggaagaa gagtggtcag agaagcaaca ttgetgaetc yaaagacaga 337 

gggcaggeca egagecaagg catgeaggag gcctccagaa gctggggcag gcaagagaat 397 

ggactcagcc ctggagcctc cagaaggcat gcagccctgc ccattccttg ettttageca 457 

ggaacattca ccctgggagc tctgtgctgc catgeaaatg teacattgea ctgaggccgc 517 

catgctgtaa aaaaccccaa gctacctacc tgaaaagctg gagagaatca gcacccatca 577 

gcttttccag cattttgggt cctcccagct aaatcategg cattgtgaag cagagatgag 637 

acctcctgcc atgccctttc caaatcctgg accaacagaa tcatcatata cactaggttt 697 

tgtgcatctc taagatgatc atcttggtgt ggggtcctga tcttcccagt gacttagcag 757 

cacagtaatt tggctgaaat gttgatgagt tcagacaaga agaaggtgat gatgacaaga 817 
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gagcgtcacg gggaaattct gccctgagga 
acccccgcca ctctttctca aatggaaaga 
cttacactga aatctcctgc tgactttgag 
aaacttagtg aagaaatgat ttgaaccttt 
ctaaacagaa tgagtctaaa gtcagtataa 
ataattgcag aggaacattt tgagatagaa 
taaaacataa ccttttaaga gaggtcattc 
aactgaacag caagggattg tagccataat 
atctcaaaat tgatcaggtt ctggagaatc 
ggaagaggtt ggaagtgaat aattttatgt 
gtgtctggag ctgagcatga tggagcctag 
tcgcaaatca tatgtcacgt caaaattgta 
aggtttggtg ccagctctaa ttagagcatt 
tgagtgcaca ttttcatata taagggcaaa 
ctacttagaa gtaataaaaa acaatactfct 
acaatttttc catttgaaga tgtgtgtcfct 
ttggacatgg aaactacaga tgaaatttaa 
gatctctttt acttaattct tctaaaatct 
taatgcttaa ataacaaaag ttactttaat 
taatatctaa tttaacgctt gtgaaaatat 
gacttgttca gggctccttt tgggafcttfca 
taccatgctc atccaataac acttcataga 
agttgaggag cattgaaaat atcacctata 
attttatcat aatttaaaat ttgatttaaa 
tgtagtttta tttgtttgct tgcttttttt 
ctgtctttat gactcatcgg ttatttataa 
aagtttcaga attaatacat ttaactgaat 

<210> 4 

<211> 2215 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 r UTR 
<222> 1. .95 



aggaatagtt tgaggctgga gagatgtgct 877 

caacctgtag gcacagaaat cattgctctg 937 

agctgagcat tggattttga tgggagagtg 997 

gttgatctaa tacttctgtg atacagttat 1057 

agacctgtgt ctctccatgt tttgctgctt 1117 

aattttatcc ctctttcaag tgacttttta 1177 

ttttgaaaaa ttgtctgaat acaaaaaagt 1237 

gttcttgacc actttagatt gacaaatagt 1297 

tattgaggtt gtatggagta accaagtgat 1357 

ggtattatgt gatttttttt ggtgggggta 1417 

atggggcagc cttctaatgc tcccagtatt 1477 

atgacaaagc tcacagcagc tgcctacgcc 1537 

gatcatttct ctccccttca tgaatgcaca 1597 

ggacatttca aagtttggat ggaatctttc 1657 

gaaatcttac caaatgttaa acaaatagca 1717 

aaggtctgtt ggatctttca aatttctatt 1777 

cttactcaaa gttacacaga actcctggct 1837 

aaagtaagaa taattttgaa attttattgg 1897 

atgttacatg tttgaagttt catttatact 1957 

tattcagtat ttttttttct tttctaaaat 2017 

agtgagaaaa actctatttc ataatattgt 2077 

tggttagcca tctctaaaat taaattttgt 2137 

tttggatttc tgtttaacac ttcaaaatgc 2197 

tacaaacctc tttatagaaa acctgataaa 2257 

tgcagggagg gatattccct tgataccact 2317 

atgttatcat cgtttttctg atttttaaat 2377 

aaatcaacca ataatgtcaa actt 2431 



<220> 
<221> CDS 
<222> 96. .311 

<220> 

<221> 3 1 UTR 
<222> 312. .2215 



<220> 

<221> polyA_signal 
<222> 2189. .2194 



<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 4 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
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aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 

Met Ala Ala Tyr Phe Phe 
1 5 
aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aa'a ggt 
Lys Pro Val Arg Arg Ser Xaa Leu Ser .Trp Pro Gly Leu Val Lys Gly 



10 15 
aac etc ate acc gga gtc aag tgt cat 



20 

cac ctt tec cat gtt cea ttg 



Asn Leu He Thr Gly Val Lys Cys His His Leu Ser His Val Pro Leu 

25 30 35 

gtt aga agt ccc age tec tgc cca cat tea agg gaa- cat tea ccc tgg 
Val Arg Ser' Pro Ser Ser Cys Pro His Ser Arg Glu His Ser Pro Trp 

40 45 50 

gag etc tgt get gcc atg caa atg tea cat tgc act gag gcc gcc atg 
Glu Leu Cys Ala Ala Met Gin Met Ser His Cys Thr Glu Ala Ala Met 
55 60 65 70 

ctg taa aaaaccccaa gctacctacc tgaaaagctg gagagaatca gcacccatca 
Leu * 

gcttttccag cattttgggt cctcccagct aaatcategg cattgtgaag cagagatgag 
acctcctgcc atgccctttc caaatcctgg accaacagaa tcatcatata cactaggttt 
tgtgcatctc taagatgatc atcttggtgt ggggtcctga tcttcccagt gacttagcag 
cacagtaatt tggctgaaat gttgatgagt tcagacaaga agaaggtgat gatgacaaga 
gagegtcacg gggaaattct gecctgagga aggaatagtt tgaggctgga gagatgtgct 
acccccgcca' ctctttctca aatggaaaga caacctgtag gcacagaaat cattgetctg 
cttacactga aatctcetgc tgactttgag agctgagcat tggattttga tgggagagtg 
aaacttagtg aagaaatgat ttgaaccttt gttgatctaa tacttctgtg atacagttat 
ctaaacagaa tgagtctaaa gtcagtataa agacctgtgt ctctccatgt tttgetgett 
ataattgeag aggaacattt tgagatagaa aattttatcc ctctttcaag tgacttttta 
taaaaeataa ccttttaaga gaggtcattc ttttgaaaaa ttgtctgaat acaaaaaagt 
aactgaacag caagggattg tagecataat gttcttgacc actttagatt gacaaatagt 
atctcaaaat tgatcaggtt ctggagaatc tattgaggtt gtatggagta accaagtgat 
ggaagaggtt ggaagtgaat aattttatgt ggtattatgt gatttttttt ggtgggggta * 
gtgtctggag ctgagcatga tggagectag atggggcagc cttctaatge tcccagtatt 
tegcaaatea tatgtcacgt caaaat.tgta atgacaaagc tcacagcagc tgcctacgcc 
aggtttggtg ccagctctaa ttagagcatt gatcatttct ctccccttca tgaatgeaca 
tgagtgcaca ttttcatata taagggcaaa ggacatttca aagtttggat ggaatctttc 
ctacttagaa gtaataaaaa acaatacttt gaaatcttac caaatgttaa acaaatagca 
acaatttttc catttgaaga tgtgtgtctt aaggtctgtt ggatctttca aatttctatt 
ttggacatgg aaactacaga tgaaatttaa cttactcaaa gttacacaga actcctggct 
gatctctttt acttaattct tctaaaatct aaagtaagaa taattttgaa attttattgg 
taatgcttaa ataacaaaag ttactttaat atgttacatg tttgaagttt catttatact 
taatatctaa tttaacgett gtgaaaatat tattcagtat ttttttttct tttctaaaat 
gacttgttca gggctccttt tgggatttta agtgagaaaa actctatttc ataatattgt 
taccatgctc atccaataac acttcataga tggttagcca tctctaaaat taaattttgt 
agttgaggag cattgaaaat atcacctata tttggatttc tgtttaacac ttcaaaatgc 
attttatcat aatttaaaat ttgatttaaa tacaaacctc tttatagaaa acctgataaa 
tgtagtttta tttgtttgct tgettttttt tgcagggagg gatattccct tgataccact 
ctgtctttat gaetcategg ttatttataa atgttatcat cgtttttctg atttttaaat 
aagtttcaga attaatacat ttaactgaat aaatcaacca ataatgtcaa actt 

<210> 5 

<211> 2132 

<212> DMA 

<213> Homo sapiens 



113 



161 



209 



257 



305 



361 

421 
481 
541 
601 
661 
721 
781 
841 
901 
961 
1021 
1081 
1141 
1201 
1261 
1321 
1381 
1441 
1501 
1561 
1621 
1681 
1741 
1801 
1861 
1921 
1981 
2041 
2101 
2161 
2215 



<220> 

<221> 5'UTR 
<222> 1. . 95 



<220> 
<221> CDS 
<222> 96. .212 
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<220> 

<221> 3,'UTR 
<222> 213. .2132 

<220> 

<221> polyA_signal 
<222> 2106. .2111 



<220> , . 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 



<400> 5 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttp ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aca ttc ace ctg gga get ctg tgc tgc cat gca aat gtc aca ttg cac 209 
Thr Phe Thr Leu Gly Ala Leu Cys Cys His Ala Asn Val Thr Leu His 

25 30 35 

tga ggccgceatg ctgtaaaaaa ccccaagcta cctacctgaa aagctggaga .262 
* 

gaatcagcac ccatcagctt ttccagcatt ttgggtcctc ccacfctaaat categgcatt 322 

gtgaagcaga gatgagacct cctgccatgc cctttccaaa tcctggacca acagaatcat 382 

catatacact aggttttgtg catctctaag atgatcatct tggtgtgggg tcctgatctt 442 

cccagtgact tagcagcaca gtaatttggc tgaaatgttg atgagttcag acaagaagaa 502 

ggt gat gat g acaagagagc gtcacgggga aattctgccc tgaggaagga atagtttgag 562 

gctggagaga tgtgctaccc ccgccactct ttctcaaatg gaaagacaac ctgtaggcac 622 

agaaatcatt "getctgetta cactgaaatc tcctgctgac tttgagagct gagcattgga 682 

ttttgatggg agagtgaaac ttagtgaaga aatgatttga acctttgttg atctaatact 742 

tctgtgatae agttatctaa acagaatgag tctaaagtca gtataaagac ctgtgtctct 802 

ccatgttttg ctgcttataa ttgcagagga acattttgag atagaaaatt ttatccctct 862 

ttcaagtgac tttttataaa acataacctt ttaagagagg tcattctttt gaaaaattgt 922 

ctgaatacaa aaaagtaact gaacagcaag ggattgtagc cataatgttc ttgaccactt 982 

tagattgaca aatagtatct caaaattgat caggttctgg agaatctatt gaggttgtat 1042 

ggagtaacca agtgatggaa gaggttggaa gtgaataatt ttatgtggta ttatgtgatt 1102 

ttttttggtg ggggtagtgt ctggagctga gcatgatgga gectagatgg ggcagccttc 1162 

taatgctccc agtatttege aaatcatatg teaegtcaaa attgtaatga caaagctcac 1222 

agcagctgcc tacgecaggt ttggtgccag ctctaattag agcattgatc atttctctcc 1282 

ccttcatgaa tgcacatgag tgcacatttt catatataag ggcaaaggac atttcaaagt 1342 

ttggatggaa tctttcctac ttagaagtaa taaaaaacaa tactttgaaa tcttaccaaa 1402 

tgttaaacaa atagcaacaa tttttccatt tgaagatgtg tgtcttaagg tctgttggat 14 62 

ctttcaaatt tctattttgg acatggaaac tacagatgaa atttaactta ctcaaagtta 1522 

cacagaactc ctggctgatc tcttttactt aattcttcta aaatctaaag taagaataat 1582 

tttgaaattt tattggtaat gcttaaataa caaaagttac tttaatatgt tacatgtttg 1642 

aagtttcatt tatacttaat atctaattta acgcttgtga aaatattatt cagtattttfc 1702 

ttttcttttc taaaatgact tgttcagggc tccttttggg attttaagtg agaaaaactc 1762 

tatttcataa tattgttacc atgctcatcc aataacactt catagatggt tagccatctc 1822 

taaaattaaa ttttgtagtt gaggagcatt gaaaatatca cctatatttg gatttctgtt 1882 

taacacttca aaatgcattt tatcataatt taaaatttga tttaaataca aacctcttta 1942 

tagaaaacct gataaatgta gttttatttg tttgettget tttttttgea gggagggata 2002 
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ttcccttgat accactctgt ctttatgact catcggttat ttataaatgt tatcatcgtt 2062 
tttctgattt ttaaataagt ttcagaatta atacatttaa ctgaataaat caaccaataa 2122 
tgtcaaactt 2132 



<210> 6 

<211> 1981 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 



<220> 
<221> CDS 
<222> 96. .260 



<220> 

<221> 3 1 0TR 
<222> 261.. 1981 

<220> 

<221> polyA__signal 
<222> 1955.. 1960 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 ; polymorphic base A or G 



<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 



<400> 6 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa ggt 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aac etc ate acc gga gtc aag tgt cat cac ctt tec cat gtt cca ttg 209 
Asn Leu He Thr Gly Val Lys Gys His His Leu Ser His Val Pro Leu 

25 30 35 

gtt aga agt ccc age tec tgc cca cat tea agg ggt ttt gtg cat etc 257 
Val Arg Ser Pro Ser Ser Cys Pro His Ser Arg Gly Phe Val His Leu 

40 45 50 

taa gatgatcatc ttggtgtggg gtcctgatct tcccagtgac ttagcagcac 310 

55 

agtaatttgg ctgaaatgtt gatgagttca gacaagaaga 
cgtcacgggg aaattctgee ctgaggaagg aatagtttga 
cccgccaetc tttctcaaat ggaaagacaa cctgtaggca 
acactgaaat ctcctgctga ctttgagagc tgagcattgg 
cttagtgaag aaatgatttg aacctttgtt gatctaatac 
aacagaatga gtctaaagtc agtataaaga cctgtgtctc 
attgeagagg aacattttga gatagaaaat tttatccctc 
aacataacct tttaagagag gtcattcttt tgaaaaattg 
tgaacagcaa gggattgtag ccataatgtt cttgaccact 



aggtgatgat gacaagagag 370 

ggctggagag atgtgctacc 430 

cagaaatcat tgetctgett 490 

attttgatgg gagagtgaaa 550 

ttctgtgata cagttatcta 610 

tccatgtttt getgettata 670 

tttcaagtga ctttttataa 730 

tctgaataca aaaaagtaac 790 

ttagattgac aaatagtatc 850 
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tcaaaattga tcaggttctg gagaatctat tgaggttgta tggagtaacc 

agaggttgga agtgaataat tttatgtggt attatgtgat tttttttggt 

tctggagctg, agcatgatgg agcctagatg gggcagcctt ctaatgctcc 

caaatcatat gtcacgtcaa aattgtaatg acaaagctca cagcagctgc 

tttggtgcca gctctaatta gagcattgat catttctctc cccttcatga 

gtgcacattt tcatatataa gggcaaagga catttcaaag tttggatgga 

cttagaagta ataaaaaaca atactttgaa atcttaccaa atgttaaaca 

atttttccat ttgaagatgt gtgtcttaag gtctgttgga tctttcaaat 

gacatggaaa ctacagatga aatttaactt actcaaagtt acacagaact 

ctcttttact taattcttct aaaatctaaa gtaagaataa ttttgaaatt 

tgcttaaata acaaaagtta ctttaatatg ttacatgttt gaagtttcat 

tatctaattt aacgcttgtg aaaatattat tcagtatttt tttttctttt 

ttgttcaggg ctccttttgg gattttaagt gagaaaaact ctatttcata 

catgctcatc caataacact tcatagatgg ttagccatct ctaaaattaa 

tgaggagcat tgaaaatatc acctatattt ggatttctgt ttaacacttc 

ttatcataat ttaaaatttg atttaaatac aaacctcttt atagaaaacc 

agttttattt gtttgcttgc ttttttttgc agggagggat attcccttga 

tctttatgac tcatcggtta tttataaatg ttatcatcgt ttttctgatt 

tttcagaatt aatacattta actgaataaa tcaaccaata atgtcaaact 

<210> 7 
<211> 1992 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> 5*OTR 
<222> 1. .95 

<220> 
<221> CDS 
<222> 96. .167 

<220> 

<221> 3'UTR 
<222> 168. .1992 

<220> 

<221> polyA_signal 
<222> 1966.. 1971 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 7 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct 217 
Lys * 



aagtgatgga 


910 


gggggtagtg 


970 


cagtatttcg 


1030 


ctacgccagg 


1090 


atgcacatga 


1150 


atctttccta 


1210 


aatagcaaca 


1270 


ttctattttg 


1330 


cctggctgat 


1390 


ttattggtaa 


1450 


ttatacttaa 


1510 


ctaaaatgac 


1570 


atattgttac 


1630 


attttgtagt 


1690 


aaaatgeatt 


1750 


tgataaatgt 


1810 


taccactctg 


1870 


tttaaataag 


1930 


t 


1981 
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ggacacacta agtaatagag ttaatcacac cgtcctggtt ttgtgcatct ctaagatgat 277 

catcttggtg tggggtcctg atcttcccag tgacttagca gcacagtaat ttggctgaaa 337 

tgttgatgag ttcagacaag aagaaggtga tgatgacaag agagcgtcac ggggaaattc 397 

tgccctgagg- aaggaatagt ttgaggctgg agagatgtgc tacccccgcc actctttctc -457 

aaatggaaag acaacctgta ggcacagaaa tcattgctct gcttacactg aaatctcctg 517 

ctgactttga gagctgagca ttggattttg atgggagagt gaaacttagt gaagaaatga 577 

tttgaacctt tgttgatcta atacttctgfc gatacagtta tctaaacaga atgagtctaa 637 

agtcagtata aagacctgtg tctctccatg ttttgctgct tataattgca gaggaacatt 697 

ttgagataga aaattttatc cctctttcaa gtgacttttt ataaaacata accttttaag 757 

agaggtcatt cttttgaaaa attgtctgaa tacaaaaaag taactgaaca gcaagggatt 817 

gtagccataa tgttcttgac cactttagat tgacaaatag tatctcaaaa ttgatcaggt 877 

tctggagaat ctattgaggt tgtatggagt aaccaagtga tggaagaggt tggaagtgaa 937 

taattttatg tggtattatg tgattttttt tggtgggggt agtgtctgga gctgagcatg 997 

atggagccta gatggggcag ccttctaatg ctcccagtat ttcgcaaatc atatgtcacg 1057 

tcaaaattgt aatgacaaag ctcacagcag ctgcctacgc caggtttggt gccagctcta 1117 

attagagcat tgatcatttc tctccccttc atgaatgcac atgagtgcac attttcatat 1177 

ataagggcaa aggacatttc - aaagtttgga t.ggaatcttt cctacttaga agtaataaaa 1237 

aacaatactt tgaaatctta ccaaatgtta aacaaatagc aacaattttt ccatttgaag 1297 

atgtgtgtct taaggtctgt tggatctttc aaatttctat tttggacatg gaaactacag 1357 

atgaaattta acttactcaa agttacacag aactcctggc tgatctcttt tacttaattc 1417 

ttctaaaatc taaagtaaga ataattttga aattttattg gtaatgctta aataacaaaa 1477 

gttactttaa tatgttacat gtttgaagtt tcatttatac ttaatatcta atttaacgct 1537 

tgtgaaaata ttattcagta tttttttttc ttttctaaaa tgacttgttc agggctcctt 1597 

ttgggatttt aagtgagaaa aactctattt cataatattg ttaccatgct catccaataa 1657 

cac-ttcatag atggttagcc atctctaaaa ttaaattttg tagttgagga gcattgaaaa 171? 

tatcacctat atttggattt. ctgtttaaca cfctcaaaatg cattttatca taatttaaaa 1777 

tttgatttaa atacaaacct ctttatagaa aacctgataa atgtagtttt atttgtttgc 1837 

ttgctttttt ttgcagggag ggatattccc ttgataccac tctgtcttta tgactcatcg 1897 

gttatttata aatgttatca tcgtttttct gatttttaaa taagtttcag aattaataca 195? 

tttaactgaa taaatcaacc aataatgtca aactt 1992 

<210> 8 

<211> 1898 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 



<220> 
<221> CDS 
<222> 96.. 182 



<220> 

<221> 3'UTR 
<222> 183. .1898 

<220> 

<221> polyA_signal 
<222> 1872.7l877 



<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 



<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
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<400> 8 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Bhe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa ggg 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

ttt tgt gca tct eta aga tga tcatcttggt gtggggtcct gatcttccca 212 
Phe Cys Ala Ser Leu Arg * 
25 

gtgacttagc agcacagtaa tttggctgaa atgttgatga gttcagacaa gaagaaggtg 272 

atgatgacaa gagagegtea eggggaaatt ctgccctgag gaaggaatag tttgaggctg 332 

gagagatgtg ctacccccgc cactctttct caaatggaaa gacaacctgt aggcacagaa 392 

ateattgetc tgcttacact gaaatctcct gctgactttg agagctgagc attggatttt 452 

gatgggagag tgaaacttag tgaagaaatg atttgaacct ttgttgatct aatacttctg 512 

tgatacagtt atctaaacag aatgagtcta aagtcagtat aaagacctgt gtctctccat 572 

gttttgctgc ttataattgc agaggaacat tttgagatag aaaattttat ccctctttca 632 

agtgactttt tataaaacat aaccttttaa gagaggtcat tcttttgaaa aattgtctga 692 

atacaaaaaa gtaactgaac agcaagggat tgtagecata atgttcttga ccactttaga 752 

ttgacaaata gtatctqaaa attgatcagg ttctggagaa tctattgagg ttgtatggag 812 
taaccaagtg atggaagagg ttggaagtga ataattttat gtggtattat gtgatttttt ' 872 

ttggtggggg tagtgtctgg agctgagcat gatggagect agatggggca gecttctaat 932 

gctcccagta tttcgcaaat catatgtcac gtcaaaattg taatgacaaa gctcacagca 992 

gctgcctacg ccaggtttgg tgccagctct aattagagca ttgatcattt ctctcccctt 1052 

catgaatgea catgagtgea cattttcata tataagggca aaggacattt caaagtttgg 1112 

atggaatctt tcctacttag aagtaataaa aaacaatact ttgaaatctt accaaatgtt 1172 

aaacaaatag caacaatttt tccatttgaa gatgtgtgtc ttaaggtctg ttggatcttt 1232 

caaatttcta ttttggacat ggaaactaca gatgaaattt aacttactca aagttacaca 1292 

gaactcctgg ctgatctctt ttacttaatt cttctaaaat ctaaagtaag aataattttg 1352 

aaattttatt ggtaatgett aaataacaaa agttacttta atatgttaca tgtttgaagt 1412 

ttcatttata cttaatatct aatttaaege ttgtgaaaat attattcagt attttttttt 1472 

cttttctaaa atgacttgtt cagggctcct tttgggattt taagtgagaa aaactctatt 1532 

tcataatatt gttaccatgc tcatccaata acacttcata gatggttagc catctctaaa 1592 

attaaatttt -gtagttgagg agcattgaaa atatcaccta tatttggatt tctgtttaac 1652 

acttcaaaat gcattttatc ataatttaaa atttgattta aatacaaacc tctttataga 1712 

aaacctgata aatgtagttt tatttgtttg ettgettttt tttgcaggga gggatattcc 1772 

cttgatacea ctctgtcttt atgactcatc ggttatttat aaatgttatc . ategttttte 1832 

tgatttttaa ataagtttca gaattaatac atttaactga ataaatcaac caataatgtc 18 92 

aaactt 1898 



<210> 9 

<211> 1783 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 e UTR 
<222> 1..95 

<220> 
<221> CDS 
<222> 96. .167 

<220> 

<221> 3 'UTR 
<222> 168.. 1783 



<220> 

<221> polyA_signal 
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<222> 1757.. 1762 



<220> 

<221> allele 
<222> 132 
<223> 8-130-143 



polymorphic base A or G 



<220> 

<221> allele 
<222> 132 
<223> 8-126-286 



polymorphic base A or G 



<400> 9 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecaget cagatgttac ttccaaccaa caaatttcct * 217 
Lys * ' 

ggacacacta agtaatagag ttaatcacac cgtcctgaaa gacaacctgt aggcacagaa 277 

ateattgetc tgcttacact gaaatctcct gctgactttg agagctgagc attggatttt 337 

gatgggagag tgaaacttag tgaagaaatg atttgaacct ttgttgatct aatacttctg 397 

tgatacagtt atctaaacag aatgagtcta aagtcagtat aaagacctgt gtctctccat 457 

gttttgctgc ttataattgc agaggaacat tttgagatag aaaattttat ccctctttca 517 

agtgactttt tataaaacat aaccttttaa gagaggtcat tcttttgaaa aattgtctga 577 

atacaaaaaa gtaactgaac agcaagggat tgtagecata atgttcttga ccactttaga 637 

ttgacaaata gtatctcaaa attgatcagg ttctggagaa tctattgagg ttgtatggag 697 

taaccaagtg atggaagagg ttggaagtga ataattttat gtggtattat gtgatttttt 757 

ttggtggggg tagtgtctgg agctgagcat gatggagect agatggggca gecttctaat 817 

gctcccagta tttegcaaat. catatgtcac gtcaaaattg taatgacaaa gctcacagca 877 

gctgcctacg ccaggtttgg tgccagctct aattagagca ttgatcattt ctctcccctt 937 

catgaatgea catgagtgea cattttcata tataagggca aaggacattt caaagtttgg 997 

atggaatctt tcctacttag aagtaataaa aaacaatact ttgaaatctt accaaatgtt 1057 

aaacaaatag caacaatttt tccatttgaa gatgtgtgtc ttaaggtctg ttggatcttt 1117 

caaatttcta ttttggacat ggaaactaca gatgaaattt aacttactca aagttacaca 1177 

gaactcctgg ctgatctctt ttacttaatt cttctaaaat ctaaagtaag aataattttg 1237 

aaattttatt ggtaatgett aaataacaaa agttacttta atatgttaca tgtttgaagt 1297 

ttcatttata cttaatatct aatttaaege ttgtgaaaat attattcagt attttttttt 1357 

cttttctaaa atgacttgtt cagggctcct tttgggattt taagtgagaa aaactctatt 1417 

tcataatatt gttaccatgc tcatccaata acacttcata gatggttagc catctctaaa 1477 

attaaatttt gtagttgagg agcattgaaa atatcaccta tatttggatt tctgtttaac 1537 

acttcaaaat gcattttatc ataatttaaa atttgattta aatacaaacc tctttataga 1597 

aaacctgata aatgtagttt tatttgtttg ettgettttt tttgcaggga gggatattcc 1657 

cttgatacca ctctgtcttt atgactcatc ggttatttat aaatgttatc ategttttte 1717 

tgatttttaa ataagtttca gaattaatac atttaactga ataaatcaac caataatgtc 1777 

aaactt 1783 

<210> 10 
<211> 2309 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> 5'UTR 
<222> 1..95 



<220> 
<221> CDS 
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<222> 96. .311 
<220> 

<221> 3'UTR 
<222> 312.. 2309 

<220> 

<221> polyA__signal 
<222> 2283. .2288 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 10 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 



aag 


cct 


gta 


agg aga 


tec 


rgt 


etc agt 


tgg cca 


ggt 


ctt 


gtg 


aaa 


ggt 


161 


Lys 


Pro 


Val 


Arg Arg 


Ser 


Xaa 


Leu Ser 


Trp Pro 


Gly Leu 


Val 


Lys 


Gly 










10 






15 








20 








aac 


etc 


ate 


acc gga 


gtc 


aag 


tgt cat 


cac ctt 


tec 


cat 


gtt 


cca 


ttg 


209 


Asn 


Leu 


lie 


Thr Gly 


Val 


Lys 


Cys His 


His Leu 


Ser 


His 


Val 


Pro 


Leu 








25 








30 






35 










gtt 


aga 


agfc 


ccc age 


tec 


tgc 


cca cat 


tea agg 


gaa 


aat 


aac 


gca 


ttt 


257 


Val 


Arg 


Ser 


Pro Ser 


Ser 


Cys 


Pro His 


Ser Arg 


Glu 


Asn 


Asn 


Ala 


Phe 






40. 








45 






50 












ata 


aaa 


gec 


age tea 


gat 


gtt 


act tec 


aac caa 


caa 


att 


tec 


tgg 


aca 


305 


lie 


Lys 


Ala 


Ser Ser 


Asp 


Val 


Thr Ser 


Asn Gin 


Gin 


lie 


Ser 


Trp 


Thr 




55 








60 






65 










70 




cac 


taa 


gtaatagagt taatcacacc gtcctgaaca ttcaccctgg 


gagctctgtg 


3 61 



His * 



ctgccatgca aatgtcacat 
tacctgaaaa gctggagaga 
agctaaatca teggcattgt 
ctggaccaac agaatcatca 
gtgtggggtc ctgatcttcc 
gagttcagac aagaagaagg 
aggaaggaat agtttgaggc 
aagacaacct gtaggcacag 
tgagagctga gcattggatt 
ctttgttgat ctaatacttc 
ataaagacct gtgtctctcc 
agaaaatttt atccctcttt 
attcttttga aaaattgtct 
taatgttctt gaccacttta 
aatctattga ggttgtatgg 
atgtggtatt atgtgatttt 
ctagatgggg cagccttcta 
tgtaatgaca aagctcacag 
cattgatcat ttctctcccc 
caaaggacat ttcaaagttt 
ctttgaaatc ttaccaaatg 
tcttaaggtc tgttggatct 



tgcactgagg ccgccatgct gtaaaaaacc ccaagctacc 4 21 

atcagcaccc atcagctttt ccagcatttt gggtcctccc 481 

gaagcagaga tgagacctcc tgccatgccc tttccaaatc 541 

tatacactag gttttgtgca tctctaagat gateatcttg 601 

cagtgactta gcagcacagt aatttggctg aaatgttgat 661 

tgatgatgac aagagagcgt caeggggaaa ttctgccctg 721 

tggagagatg tgctaccccc gccactcttt ctcaaatgga 7 81 

aaatcattgc tetgettaca ctgaaatctc ctgetgaett 8 41 

ttgatgggag agtgaaactt agtgaagaaa tgatttgaac 901 

tgtgatacag ttatctaaac agaatgagtc taaagtcagt 961 

atgttttgct gcttataatt gcagaggaac attttgagat 1021 

caagtgactt tttataaaac ataacctttt aagagaggtc 1081 

gaatacaaaa aagtaactga acagcaaggg attgtageca 1141 

gattgacaaa tagtatctca aaattgatca ggttctggag 1201 

agtaaccaag tgatggaaga ggttggaagt gaataatttt 12 61 

ttttggtggg ggtagtgtct ggagctgagc atgatggagc 1321 

atgctcccag tatttegcaa atcatatgtc aegtcaaaat 1381 

cagctgccta cgccaggttt ggtgccagct ctaattagag 1441 

ttcatgaatg cacatgagtg cacattttca tatataaggg 1501 

ggatggaatc tttcctactt agaagtaata aaaaacaata 1561 

ttaaacaaat agcaacaatt tttccatttg aagatgtgtg 1621 

ttcaaatttc tattttggac atggaaacta cagatgaaat 1681 
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ttaacttact 


caaagttaca 


atctaaagta 


agaataattt 


taatatgtta 


catgtttgaa 


atattattca 


gtattttttt 


tttaagtgag 


aaaaactcta 


tagatggtta 


gccatctcta 


tatatttgga 


tttctgttta 


taaatacaaa 


cctctttata 


tttttgcagg 


gagggatatt 


ataaatgtta 


tcatcgtttt 


gaataaatca 


accaataatg 


<210> 


11 




<211> 


2075 




<212> 


DNA . 




<213> 


Homo 


sapiens 


<220> 






<221> 


5 f UTR 


<222> 


1..9.5 


<220> 






<221> 


CDS 




<222> 


96. .311 


<220> 






<221> 


3 1 UTR 


<222> 


312. . 


.2075 



cagaactcct 
tgaaatttta 
gtttcattta 
ttcttttcta 
tttcataata 
aaattaaatt 
acacttcaaa 
gaaaacctga 
cccttgatac 
tctgattttt 
tcaaactt 



ggctgatctc 
ttggtaatgc 
tacttaatat 
aaatgacttg 
ttgttaccat 
ttgtagttga 
atgcatttta 
taaatgtagt 
cactctgtct 
aaataagttt 



ttttaettaa 
ttaaataaca 
ctaatttaac 
ttcagggctc 
gctcatccaa 
ggagcattga 
tcataattta 
tttatttgtt 
ttatgactca 
cagaattaat 



ttcttctaaa 
aaagttactt 
gcttgtgaaa 
cttttgggat 
taacacttca 
aaatatcacc 
aaatttgatt 
tgcttgcttt 
tcggttattt 
acatttaacfc 



1741 
1801 
1861 
1921 
1961 
2041 
2101 
2161 
2221 
2281 
2309 



<220> 

<221> polyA_signal 
<222> 2049. .2054 



<220> 

<221> allele 
<222> 132 

<223> 8-130-14 3 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 



<400> 11 

ttgcaggtcc acattcgttc tiagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 



aag 


cct 


gta 


agg 


aga 


tec 


rgt 


etc 


agt 


tgg cca 


ggt 


ctt 


gtg 


aaa 


ggt 


161 


kys 


Pro 


Val 


Arg 


Arg 


Ser 


Xaa 


Leu 


Ser 


Trp Pro 


Gly 


Leu 


Val 


Lys 


Gly 










10 










15 








20 








aac 


etc 


ate 


ace 


gga 


gtc 


aag 


tgt 


cat 


cac ctt 


tec 


cat 


gtt 


cca 


ttg 


209 


Asn 


Leu 


He 


Thr 


Gly 


Val 


Lys 


Cys 


His 


His Leu 


Ser 


His 


Val 


Pro 


Leu 








25 










30 








35 










gtt 


aga 


agt 


ccc 


age 


tec 


tgc 


cca 


cat 


tea agg 


gaa 


aat 


aac 


gca 


ttt 


257 


Val 


Arg 


Ser 


Pro 


Ser 


Ser 


Cys 


Pro 


His 


Ser Arg 


Glu 


Asn 


Asn 


Ala 


Phe 






40 










45 








50 












ata 


aaa 


gcc 


age 


tea 


gat 


gtt 


act 


tec 


aac caa 


caa 


att 


tec 


tgg 


aca 


305 


lie 


Lys 


Ala 


Ser 


Ser 


Asp 


Val 


Thr 


Ser 


Asn Gin 


Gin 


He 


Ser 


Trp 


Thr 




55 










60 








65 










70 




cac 


taa 


gtaatagagt taatcacace gtcctggttt tgtgcatctc 


taagatgatc 


361 
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His * 

atcttggtgt ggggtcctga tcttcccagt gacttagcag cacagtaatt tggctgaaat 421 

gttgatgagt tcagacaaga agaaggtgat gatgacaaga gagcgtcacg gggaaattct 481 

gccctgagga aggaatagtt tgaggctgga gagatgtgct acccccgcca ctctttctca 541 

aatggaaaga caacctgtag gcacagaaat cattgctctg cttacactga aatctcctgc 601 

tgactttgag agctgagcat tggattttga tgggagagtg aaacttagtg aagaaatgat 661 

ttgaaccttt gttgatctaa tacttctgtg atacagttat ctaaacagaa tgagtctaaa 721 

gtcagtataa agacctgtgt ctctccatgt tttgctgctt ataattgcag aggaacattt 781 

tgagatagaa aattttatcc ctctttcaag tgacttttta taaaacataa ccttttaaga 841 

gaggtcattc ttttgaaaaa ttgtctgaat acaaaaaagt aactgaacag caagggattg 901 

tagccataat gttcttgacc actttagatt gacaaatagt atctcaaaat tgatcaggtt 961 

ctggagaatc tattgaggtt gtatggagta accaagtgat ggaagaggtt ggaagtgaat 1021 

aattttatgt ggtattatgt- gatttttttt ggtgggggta gtgtctggag ctgagcatga 1081 

tggagcctag atggggcagc cttctaatgc tcccagtatt tcgcaaatca tatgtcacgt 1141 

caaaattgta atgacaaagc tcacagcagc tgcctacgcc aggtttggtg ccagctctaa 1201 

ttagagcatt gatcatttct ctccccttca tgaatgcaca tgagtgcaca ttttcatata 1261 

taagggcaaa ggacatttca aagtttggat ggaatctttc ctacttagaa gtaataaaaa 1321 

acaatacttt gaaatcttac caaatgttaa acaaatagca acaatttttc catttgaaga 1381 

tgtgtgtctt aaggtctgtt ggatctttca aatttctatt ttggacatgg aaactacaga 14 41 

tgaaatttaa cttactcaaa gttacacaga actcctggct gatctctttt acttaattct. 1501 

tctaaaatct aaagtaagaa taattttgaa attttattgg taatgcttaa ataacaaaag 1561 

ttactttaat atgttacatg tttgaagttt catttatact taatatctaa tttaacgctt 1621 

gtgaaaatat tattcagtat ttttttttct tttctaaaat gacttgttca gggctccttt 1681 

tgggatttta agtgagaaaa actctatttc ataatattgt taccatgctc atccaataac 1741 

acttcataga tggttagcca tctctaaaat taaattttgt agttgaggag cattgaaaafc 1801 

atcacctata tttggatttc tgtttaacac ttcaaaat'gc attttatcat aatttaaaat 1861 

ttgatttaaa tacaaacctc tttatagaaa acctgataaa tgtagtttta tttgtttgct. 1921 

tgcttttttt tgcagggagg gatattccct tgataccact ctgtctttat gactcatcgg 1981 

ttatttataa atgttatcat cgtttttctg atttttaaat aagtttcaga attaatacat 2041 

ttaactgaat aaatcaacca ataatgtcaa actt 2075 

<210> 12 

<211> 1923 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> 5'UTR 
<222> 1. .95 

<2Z0> 
<221> CDS 
<222> 96.. 212 



<220> 

<221> 3'UTR 
<222> 213.. 1923 

<220> 

<221> polyA_signal 
<222> 1897. .1902 



<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 ; polymorphic base A or G 
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<400> 12 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 

Met Ala Ala Tyr Phe Phe 
1 5 
aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aca ttc acc ctg gga get ctg tgc tgc cat gca aat gtc aca ttg cac 
Thr Phe Thr Leu Gly Ala Leu Cys Cys His Ala Asn Val Thr Leu His 



25 



30 



35 



tga ggccgccatg ctgtaaaaaa ccccaagcta cctacctgaa aagctggaga 



60 
113 



161 



209 



262 



gaatcagcac 
gtgaagcaga 
catatacact 
ctcctgctga 
aaatgatttg 
gtctaaagtc 
aacattttga 
tttaagagag 
gggattgtag 
tcaggttctg 
agtgaataat 
agcatgatgg 
gtcacgtcaa 
gctctaatta 
tcatatataa 
ataaaaaaca 
ttgaagatgt 
ctacagatga 
taattcttct 
acaaaagtta 
aacgcttgtg 
ctccttttgg 
caataacact 
tgaaaatatc 
ttaaaatttg 
gtttgcttgc 
teateggtta 
aatacattta 



ccatcagctt 
gat gaga cct 
agaaagacaa 
ctttgagagc 
aacctttgtt 
agtataaaga 
gatagaaaat 
gtcattcttt 
ccataatgtt 
gagaatctat 
tttatgtggt 
agectagatg 
aattgtaatg 
gagcattgat 
gggcaaagga 
atactttgaa 
gtgtcttaag 
aatttaactt 
aaaatctaaa 
ctttaatatg 
aaaatattat 
gattttaagt 
tcatagatgg 
acctatattt 
atttaaatac 
ttttttttgc 
tttataaatg 
actgaataaa 



ttccagcatt 
cctgccatgc 
cctgtaggca 
tgagcattgg 
gatctaatac 
cctgtgtctc 
tttatccctc 
tgaaaaattg 
cttgaccact 
tgaggttgta 
attatgtgat 
gggcagcett 
acaaagctca 
catttctctc 
catttcaaag 
atcttaccaa 
gtctgttgga 
actcaaagtt 
gtaagaataa 
ttacatgttt 
tcagtatttt 
gagaaaaact 
ttagecatet 
ggatttctgt 
aaacctcttt 
agggagggat 
ttatcatcgt 
tcaaccaata 



ttgggtcctc 
cctttccaaa 
cagaaatcat 
attttgatgg 
ttctgtgata 
tccatgtttt 
tttcaagtga 
tctgaataca 
ttagattgac 
tggagtaacc 
tttttttggf 
ctaatgctcc 
cagcagctgc 
cccttcatga 
tttggatgga 
atgttaaaca 
tctttcaaat 
acacagaact 
ttttgaaatt 
gaagtttcat 
tttttctttt 
ctatttcata 
ctaaaattaa 
ttaacacttc 
atagaaaacc 
attcccttga 
ttttctgatt 
atgtcaaact 



ccagctaaat 
tcctggacca 
tgetctgett 
gagagtgaaa 
cagttatcta 
getgettata 
ctttttataa 
aaaaagtaac 
aaatagtatc 
aagtgatgga 
gggggtagtg 
cagtatttcg 
ctacgccagg 
atgcacatga 
atctttccta 
aatagcaaca 
ttctattttg 
cctggctgat 
ttattggtaa 
ttatacttaa 
ctaaaatgac 
atattgttac 
attttgtagt 
aaaatgeatt 
tgataaatgt 
taccactctg 
tttaaataag 
t 



categgcatt 
acagaatcat 
acactgaaat 
cttagtgaag 
aacagaatga 
attgeagagg 
aacataacct 
tgaacagcaa 
tcaaaattga 
agaggttgga 
tctggagctg 
caaatcatat 
tttggtgcca 
gtgcacattt 
cttagaagta 
atttttccat 
gacatggaaa 
ctcttttact 
tgcttaaata 
tatctaattt 
ttgttcaggg 
catgctcatc 
tgaggagcat 
ttatcataat 
agttttattt 
tctttatgac 
tttcagaatt 



322 
382 
442 
502 
562 
622 
682 
742 
802 
862 
922 
982 
1042 
1102 
1162 
1222 
1282 
1342 
1402 
1462 
1522 
1582 
1642 
1702 
1762 
1822 
1882 
1923 



<210> 13 

<211> 2308 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5'UTR 
<222> 1..95 



<220> 
<221> CDS 
<222> 96., 167 

<220> 

<221> 3'UTR 
<222> 168. .2308 



<220> 
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<221> polyA_$ignal 
<222> 2282. .2287 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 

<220> - 
<221> allele 
<222> 132 

<223> 8-126-286 ; polymorphic base A or G 
<400> 13 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagccagct cagatgttac ttccaaccaa caaatttcct 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtcctaaac aatacgaaca gaaacaaagc 277 

tatgtacttg ttttgggaag caacctcaaa tggaaagatg ggggctgtgt tctaaaagga 337 

acattcaccc tgggagctct gtgetgecat gcaaatgtca cattgeactg aggccgccat 397 

getgtaaaaa accccaagct acctacctga aaagctggag agaatcagca cccatcagct 4 57 

tttccagcat tttgggtcct cccagctaaa teateggcat tgtgaagcag agatgagacc. 517 

tcctgccatg ccctttccaa atcctggacc aacagaatca tcatatacac taggttttgt 577 

gcatctctaa gatgatcatc ttggtgtggg gtcctgatct tcccagtgac ttagcagcac 637 

agtaatttgg ctgaaatgtt gatgagttca gacaagaaga aggtgatgat gacaagagag 697 

cgtcacgggg aaattctgee ctgaggaagg aatagtttga ggctggagag atgtgctacc 757 

cccgccactc tttctcaaat ggaaagacaa cctgt,aggca cagaaatcat tgetctgett 817 

acactgaaat ctcctgctga ctttgagagc tgagcattgg attttgatgg gagagtgaaa 877 

cttagtgaag aaatgatttg aacctttgtt gatctaatac ttctgtgata cagttatcta 937 

aacagaatga gtctaaagtc agtataaaga cctgtgtctc tccatgtttt getgettata 997 

attgeagagg aacattttga gatagaaaat tttatccctc tttcaagtga ctttttataa 1057 

aacataacct tttaagagag gtcattcttt tgaaaaattg tctgaataca aaaaagtaac 1117 

tgaacagcaa gggattgtag ccataatgtt cttgaccact ttagattgac aaatagtatc 1177. 

tcaaaattga tcaggttetg gagaatctat tgaggttgta tggagtaacc aagtgatgga 1237 

agaggttgga agtgaataat tttatgtggt attatgtgat tttttttggt gggggtagtg 1297 

tctggagctg agcatgatgg agectagatg gggcagcett ctaatgctcc cagtatttcg 1357 

caaatcatat gtcacgtcaa aattgtaatg acaaagctca cagcagctgc ctacgccagg 1417 

tttggtgcca gctctaatta gagcattgat catttctctc cccttcatga atgcacatga 1477 

gtgcacattt tcatatataa gggcaaagga catttcaaag tttggatgga atctttccta 1537 

cttagaagta ataaaaaaca atactttgaa atcttaccaa atgttaaaca aatagcaaca 1597 

atttttccat ttgaagatgt gtgtcttaag gtctgttgga tctttcaaat ttctattttg 1657 

gacatggaaa ctacagatga aatttaactt actcaaagtt acacagaact cctggctgat 1717 

ctcttttact taattcttct aaaatctaaa gtaagaataa ttttgaaatt ttattggtaa 1777 

tgcttaaata acaaaagtta ctttaatatg ttacatgttt gaagtttcat ttatacttaa 1837 

tatctaattt aacgcttgtg aaaatattat tcagtatttt tttttctttt ctaaaatgac 18 97 

ttgttcaggg ctccttttgg gattttaagt gagaaaaact ctatttcata atattgttac 1957 

catgctcatc caataacact tcatagatgg ttagecatet ctaaaattaa attttgtagt 2017 

tgaggagcat tgaaaatatc acctatattt ggatttctgt ttaacacttc aaaatgeatt 2077 

ttatcataat ttaaaatttg atttaaatac aaacctcttt atagaaaacc tgataaatgt 2137 

agttttattt gtttgcttgc ttttttttgc agggagggat attcccttga taccactctg 2197 

tctttatgac teateggtta tttataaatg ttatcatcgt ttttctgatt tttaaataag 2257 

tttcagaatt aatacattta actgaataaa tcaaccaata atgtcaaact t 2308 

<210> 14 
<211> 2706 
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<212> DNA 

<213> Homo sapiens 

<220> 

<22.1> 5'UTR 
<222> 1. ,95 

<220> 
<221> CDS 

<222> 96. .167 • 
<220> 

<221> 3*UTR 
<222> 160. .2706 

<220> 

<221> polyA^signal 
<222> 2680.. 2685 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-286 : polymorphic base A or G 
<400> 14 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 

aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aaa taa cgcatttata aaagecagct cagatgttac ttccaaccaa caaatttcct 217 
Lys * 

ggacacacta agtaatagag ttaatcacac cgtcctgaac attcaccctg ggagctctgt 277 

gctgccatgc aaatgtcaca ttgeactgag gccgccatgc tgtaaaaaac cccaagctac 337 

ctacctgaaa agctggagag aatcagcacc catcagcttt tccagcattt tgggtcctce 397 

cagctaaatc ateggcattg tgaagcagag atgagacctc ctgccatgcc ctttccaaat 4 57 

cetggaccaa cagaatcatc atatacacta gtaagctttt actttaagee actaatttgg 517 

ggtgtagctc ttttgeagea aaagatcatc agaacaetgc catgeaggag ctgataaact 577 

tgctcactgt tatgtatgta tccaatgact actcacacac acattacttg cttcactcta 637 

ctttctggct ttccattcag caaattttaa gaaatgaaaa tgttattcgt gggaaattaa 697 

ctcaagatca atattgttgt agttaatctc tttgtaaatg gtaaaatatt aaattattgt 757 

gagtcatttt tatttaaaga agtttggttc fcttaaatgaa agaagtttgg gtcttttgtg 817 

gatactctgc atatagacat tctagacatg aaagaatgtt gaagacaaag aatgttgaag 877 

aaatatagac atgaaagact gttgaagttt ggttttttgt ggatactctg catatagaca 937 

ttctagactg gagtgataca tcttgttcag ggttttgtgc atctctaaga tgatcatctt 997 

ggtgtggggt cctgatcttc ccagtgactt agcagcacag taatttggct gaaatgttga 1057 

tgagttcaga caagaagaag gtgatgatga caagagagcg teaeggggaa attctgccct 1117 

gaggaaggaa tagtttgagg ctggagagat gtgctacccc cgccactctt tctcaaatgg 1177 

aaagacaacc tgtaggcaca gaaatcattg ctctgcttac actgaaatct cctgctgact 1237 

ttgagagctg agcattggat tttgatggga gagtgaaact tagtgaagaa atgatttgaa 1297 

cetttgttga tctaatactt ctgtgataca gttatctaaa cagaatgagt ctaaagtcag 1357 

tataaagacc tgtgtctctc catgttttgc tgcttataat tgcagaggaa cattttgaga 1417 

tagaaaattt tatccctctt tcaagtgact ttttataaaa cataaccttt taagagaggt 14 77 

cattcttttg aaaaattgtc tgaatacaaa aaagtaactg aacagcaagg gattgtagee 1537 
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ataatgttct tgaccacttt agattgacaa atagtatctc aaaattgatc aggttctgga 1597 

gaatctattg aggttgtatg gagtaaccaa gtgatggaag .aggttggaag tgaataattt 1657 

tatgtggtat tatgtgattt tttttggtgg gggtagtgtc tggagctgag catgatggag 1717 

cctagatggg gcagccttct aatgctccca gtatttcgca aatcatatgt cacgtcaaaa 1777 

ttgtaatgac aaagctcaca gcagctgcct acgccaggtt tggtgccagc tctaattaga 1837 

gcattgatca tttctctccc cttcatgaat gcacatgagt gcacattttc atatataagg 1897 

gcaaaggaca tttcaaagtt tggatggaat ctttcctact tagaagtaat aaaaaacaat 1957 

actttgaaat cttaccaaat gttaaacaaa tagcaacaat ttttccattt gaagatgtgt 2017 

gtcttaaggt ctgttggatc tttcaaattt ctattttgga catggaaact acagatgaaa 2077 

tttaacttac tcaaagttac acagaactcc tggctgatct cttttactta .attcttctaa 2137 

aatctaaagt aagaataatt ttgaaatttt attggtaatg cttaaataac aaaagttact 2197 

ttaatatgtt acatgtttga agtttcattt atacttaata tctaatttaa cgcttgtgaa 2257 

aatattattc agtatttttt tttcttttct aaaatgactt gttcagggct ccttttggga 2317 

ttttaagtga gaaaaactct atttcataat attgttacca tgctcatcca ataacacttc 2377 

atagatggtt. agccatctct aaaattaaat tttgtagttg aggagcattg aaaatatcac 24 37 

ctatatttgg atttctgttt aacacttcaa aatgcatttt atcataattt aaaatttgat 24 97 

ttaaatacaa acctcttta't agaaaacctg ataaatgtag ttttatttgt ttgcttgctt 2557 

ttttttgcag ggagggatat tcccttgata ccactctgtc tttatgactc atcggttatt: 2617 

tataaatgtt atcatcgttt ttctgatttt taaataagtt tcagaattaa tacatttaac 2677 

tgaataaatc aaccaataat gtcaaactt 2706 

<210> 15 

<211> 2612 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 1 UTR 
<222> 1. .95 

<220> 

<221> CDS • 
<222> 96.. 212 

<220> 

<221> 3* UTR 
<222> 213. .2612 

<220> 

<221> polyA_signal 
<222> 2586.. 2591 

<220> 

<221> allele 
<222> 132 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 132 

<223> 8-126-28 6 : polymorphic base A or G 
<4O0> 15 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggcctc ataggcaagg 60 
aactcatagt ttcttgccat gaagactttt ccaac atg get get tac ttc ttc 113 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ct:t gtg aaa gga 161 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys Gly 

10 15 20 

aca ttc acc ctg gga get ctg tgc tgc cat gca aat gtc aca ttg cac 209 
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Thr Phe Thr Leu Gly Ala Leu Cys Cys His Ala Asn Val Thr Leu His 

25 30 35 

tga ggccgccatg ctgtaaaaaa ccccaagcta cctacctgaa aagctggaga 262 
* 

gaatcagcac ccatcagctt ttccagcatt ttgggtcctc ccagctaaat catcggcatt 322 

gtgaagcaga gatgagacct cctgccatgc cctttccaaa tcctggacca acagaatcat 382 

catatacact agtaagcttt tactt/taagc cactaatttg gggtgtagct cttttgcagc 442 

aaaagatcat cagaacactg ccatgcagga gctgataaac ttgctcactg ttatgtatgt 502 

atccaatgac tactcacaca cacattactt gcttcactct actttctggc tttccattca 562 

gcaaatttta agaaatgaaa atgttattcg tgggaaatta actcaagatc aatattgttg 622 

tagttaatct ctttgtaaat ggtaaaatat taaattattg tgagtcattt ttatttaaag 682 

aagtttggtt ctttaaatga aagaagtttg ggtcttttgt ggatactctg catatagaca 74 2 

ttctagacat gaaagaatgt. tgaagacaaa gaatgttgaa gaaatataga catgaaagac 802 

tgttgaagtt tggttttttg tggatactct gcatatagac attctagact ggagtgatac 862 

atcttgttca gggttttgtg catctctaag atgatcatct tggtgtgggg tcctrgatctt 922 

cccagtgact tagcagcaca gtaatttggc tgaaatgttg atgagttcag acaagaagaa 982 

ggtgatgatg acaagagagc gtcacgggga aattctgccc tgaggaagga atagtttgag 1042 

gctggagaga tgtgctaccc ccgccactct ttctcaaatg gaaagacaac ctgtaggcac 1102 

agaaatcatt gctctgctta cactgaaatc tcctgctgac tttgagagct gagcattgga 1162 

ttttgatggg agagtgaaac ttagtgaaga aatgatttga acctttgttg atctaatact 1222 

tctgtgataa agttatctaa acagaatgag tctaaagtca gtataaagac ctgtgtctct 1282 

ccatgttttg ctgcttataa ttgcagagga acattttgag atagaaaatt ttatccctct 1342 

ttcaagtgac tttttataaa acataacctt ttaagagagg tcattctttt gaaaaattgt 1402 

ctgaatacaa aaaagtaact gaacagcaag ggattgtagc cataatgttc ttgaccactt 1462 

tagattgaca aatagtatct caaaattgat caggttctgg agaatctatt gaggttgtat 1522 

ggagtaacca agtgatggaa gaggttggaa gtgaataatt ttatgtggta ttatgtgatt 1582 

ttttttggtg ggggtagtgt ctggagctga gcatgatgga gcctagatgg ggcagccttc 1642 

taatgctccc agtatttcgc aaatcatatg tcacgtcaaa attgtaatga caaagctcac 1702 

agcagctgcc tacgccaggt ttggtgccag ctctaattag agcattgatc atttctctcc 1762 

ccttcatgaa tgcacatgag tgcacatttt catatataag ggcaaaggac atttcaaagt 1822 

ttggatggaa tctttcctac ttagaagtaa taaaaaacaa tactttgaaa tcttaccaaa 1882 

tgttaaacaa atagcaacaa tttttccatt tgaagatgtg tgtcttaagg tctgttggat 1942 

ctttcaaatt tctattttgg acatggaaac tacagatgaa atttaactta ctcaaagtta 2002 

cacagaactc ctggctgatc tcttttactt aattcttcta aaatctaaag taagaataat 2062 

tttgaaattt tattggtaat gcttaaataa caaaagttac tttaatatgt tacatgtttg. 2122 

aagtttcatt tatacttaat atctaattta acgcttgtga aaatattatt cagtattttt 2182 

ttttcttttc taaaatgact tgttcagggc tccttttggg attttaagtg agaaaaactc 2242 

tatttcataa tattgttacc atgctcatcc aataacactt catagatggt tagccatctc 2302 

taaaattaaa ttttgtagtt gaggagcatt gaaaatatca cctatatttg gatttctgtt 2362 

taacacttca aaatgcattt tatcataatt taaaatttga tttaaataca aacctcttta 24 22 

tagaaaacct gataaatgta gttttatttg tttgcttgct tttttttgca gggagggata 24 82 

ttcccttgat accactctgt ctttatgact catcggttat ttataaatgt tatcatcgtt 2542 

tttctgattt ttaaataagt ttcagaatta atacatttaa ctgaataaat caaccaataa 2602 

tgtcaaactt 2612 

<210> 16 
<211> 434 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> 5*UTR 
<222> 1..370 

<220> 
<221> CDS 
<222> 371. .436 

<220> 

<221> 3 1 UTR 
<222> 437. .434 
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<220> 

<221> allele 
<222> 407 
<223> 8-130-143 



polymorphic base A or G 



<220> 

<221> allele 
<222> 407 
<223> 8-126-286 



: polymorphic base A or G 



<400> 16 

tcccagtggg attcaaccac tcatagactg cttttgttca aactgcgttc aaataaggca 

aacgccgagc tggaaccaat ctcactgttt ctgtacctcg cttctaattc cttttcagaa 

aacaaaaact cctaagagaa tgaatagaaa ttttcttaag gaattataaa gaaccatctc 

agaattttac tctgttactg caaaatctca tgcaaagata accaagtacg aagaattaac 

tgaaattcat cccaagatga ggcatcttgg ctgagttgca ggtccacatt cgttctagag 

gctgcctgcg gtccttagag gcctcatagg caaggaactc atagtttctt gccatgaaga 
cttttccaac atg get get tac ttc ttc aag cct gta agg aga tec rgt 
Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa 
1 5 10 
etc agt tgg cca ggt ctt gtg aaa g 
Leu Ser Trp Pro Gly Leu Val Lys 
15 20 



60 
120 
180 
240 
300 
360 
409 



434 



<210> 17 

<211> 7 60 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> 5 * UTR 
<222> 1..696 



<220> 
<221> CDS 
<222> 697. 



.762 



<220> 

<221> 3 T 0TR 
<222> 763. .760 



<220> 

<221> allele 
<222> 733 

<223> 8-130-143 : polymorphic base A or G 
<220> 

<221> allele 
<222> 733 

<223> 8-126-286 : polymorphic base A or G 



<400> 17 

tcccagtggg attcaaccac tcatagactg cttttgttca aactgcgttc aaataaggca 60 

aacgccgagc tggaaccaat ctcactgttt ctgtacctcg; cttctaattc ctgtacgtca 120 

ctttaccttt tttgtctata aatttattct gaccacaagg cacccctgga gcctctgtga 180 

gtctgctgtg gttctgggga ctgeeggatt cgcgaatcgt tcatttctca gttaaactcc 240 

tttaaattaa ttcagctgaa gtttttcttt tatcaatgaa aagactagac tggcttagee 300 

ttccagcctg cacctttctc ccatgctgga tgcttcctgc ccttgaatat cagactccaa 360 

gttcttcagt tttgggactc agactggctt ccttgctcct cagcttgeag atgecctatt 420 

gtggggtctt gtgttcgttt tcagaaaaca aaaactccta agagaatgaa tagaaatttt 4 8o 
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cttaaggaat tataaagaac catctcagaa ttttactctg ttactgcaaa atctcatgca 540 
aagataacca agtacgaaga attaactgaa attcatccca agatgaggca tcttggctga 600 
gttgcaggtc cacattcgtt ctagaggctg cctgcggtcc ttagaggcct cataggcaag 660 
gaactcatag tttcttgcca tgaagacttt tccaac atg get get tac ttc ttc 714 

Met Ala Ala Tyr Phe Phe 
1 5 

aag cct gta agg aga tec rgt etc agt tgg cca ggt ctt gtg aaa g 760 
Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp Pro Gly Leu Val Lys 
10 15 20 

<210> 18 
<211> 23 
<212> PRT 

<213> Homo sapiens 



<220> 

<221> VARIANT 
<222> 13 

<223> Xaa=*Gly or Ser 
<400> 18 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 
1 5 10 15 

Pro Gly Leu Val Lys Gly Lys 
20 



<210> 19 
<2U> 71 
<212> PRT 

<213> Homo sapiens 
<220> 

<221> VARIANT 
<222> 13 

<223> Xaa-Gly or Ser 



<400> 19 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 

1 5 10 15 

Pro Gly Leu Val Lys Gly Asn Leu He Thr Gly Val Lys Cys His His 

20 25 30 

Leu Ser His Val Pro Leu Val Arg Ser Pro Ser Ser Cys Pro His Ser 

35 40 45 

Arg Glu His Ser Pro Trp Glu Leu Cys Ala Ala Met Gin Met Ser His 

50 55 60 

Cys Thr Glu Ala Ala Met Leu 
65 70 

<210> 20 
<211> 38 
<2l2> PRT 

<213> Homo sapiens 



<220> 

<221> VARIANT 
<222> 13 

<223> Xaa^Gly or Ser 



<400> 20 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 
15 10 15 
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Pro Gly Leu Val Lys Gly Thr Phe. Thr Leu Gly Ala Leu Cys Cys His 

20 25 30 

Ala Asn Val Thr Leu His 
35 

<210> 21 

<211> 54 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> 13 

<223> Xaa^Gly or Ser 
<400> 21 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 

1 5 10 15 

Pro Gly Leu Val Lys Gly Asn Leu He Thr Gly Val Lys Cys His His 

20 25 30 

Leu Ser His Val Pro Leu Val Arg Ser Pro Ser Ser Cys Pro His Ser 

35 40 45 

Arg Gly Phe Val His Leu 
50 

<210> 22 

<211> 28 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> 13 

<223> Xaa-Gly or Ser 
<400> 22 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 
1 5 10 15 

Pro Gly Leu Val Lys Gly Phe Cys Ala Ser Leu Arg 
20 25 

<210> 23 

<211> 71 

<212> PRT 

<213> Homo sapiens 

<220> 

<221> VARIANT 
<222> 13 

<223> Xaa^Gly or Ser 
<400> 23 

Met Ala Ala Tyr Phe Phe Lys Pro Val Arg Arg Ser Xaa Leu Ser Trp 

15 10 15 

Pro Gly Leu Val Lys Gly Asn Leu He Thr Gly Val Lys Cys His His 

20 25 30 

Leu Ser His Val Pro Leu Val Arg Ser Pro Ser Ser Cys Pro His Ser 

35 40 45 

Arg Glu Asn Asn Ala Phe He Lys Ala Ser Ser Asp Val Thr Ser Asn 

50 55 60 

Gin Gin lie Ser Trp Thr His 
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65 70 

<210> 24 

<211> 427 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele . - . 

<222> 231 

<223> 8-102-231 : polymorphic base C or T 
<220> 

<221> misc_J)inding 

<222> 219. ,243 

<223> 8-102-231. probe 

<220> 

<221> primerjbind 
<222> 212. .230 
<223> 8-102-231. mis 

<220> 

<221> primer_bind 
<222> 232. .250 

<223> 8-102-231. mis complement 
<220> 

<221> primer bind 
<222> 1. .20 
<223> 8-102. pu 

<220> 

<221> primer bind 

<222> 408. .427 

<223> 8-102. rp complement, 

<400> 24 

cccacagaaa tatctaattg aaatattaaa ttgatttgga ttatctactc tcttgagaga 60 

gatggtataa'aaattcatta attataattt tatgtgcatg taacaaacag tcttttttgt 120 

tttcatggtt ta^gtacatg tcactgtggc aggtgtaagt gtgataattc agatggaagt 180 

ggacttgtgt atggtaaatt ttgtgagtgt gacgatagag aatgcataga ygatgaaaca 240 

gaagaaatat gtggaggtat gtatattggc tctgtagaaa ttaataaaag tacctgtttt 300 

tttcacctat ttcattgttc ttttatctct acataagttt ctgctctagg tgtgctaatg 360 

tagtttaaac ctatgtttat ctggaaaaca tttcacataa ggagagtgag actatggcat 420 

taagagt 42 7 

<210> 25 
<211> 310 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 105 

<223> 8-104-105 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 93. .117 

<223> 8-104-105. probe 
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<220> 

<221> primer Joind 
<222> 86. .204 
<223> 8-104-105 .mis 

<220> 

<221> primer bind 
<222> 106. .124 

<223> 8-104-105, mis complement 
<220> 

<221> primerjbind 
<222> 1. .21 
<223> 8-104. pu 

<220> 

<221> primerjbind 

<222> 295. .310 

<223> 8-104. rp complement 

<400> 25 ' 

gatggtgttt ttccttttta gaatattttt gtctgaatat atgtgttttt ttcaaacaca 60 

aagtaaacag aaaataataa gcgatatgtg aaatgctttc tttcrtggtt tttcttgact 120 

ggattcttat aaaattcttc ttcccaccta gggacttgtg tatgtggtga atgtacctgt 180 

cacgatgttg atccgactgg ggactgggga gatattcatg gggacacctg tgaatgtgat 240 

gagagggact gtagagctgt ctatgaccga tattctgatg acttctgttc aggwaagggc 300 

tcttccaatt 310 

<210> 26 
<211> 418 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 251 

<223> 8-94-252 : polymorphic base A or G 
<220> 

<221> misc_binding 
<222> 239- .263 
<223> 8-94-252. probe 

<220> 

<221> primer_bind 
<222> 232. .250 
<223> 8-94-252. mis 

<220> 

<221> primerjbind 
<222> 252. .270 

<223> 8-94-252. mis complement 
<220> 

<221> primerjbind 
<222> 1. .18 
<223> 8-94. pu 

<220> 

<221> primerjbind 
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<222> 400. .418 

<223> 8-94. rp complement 



<400> 26 

gtggtctatg caggtatgga tggacattct cacacctaac caagcaatgt ttgcttttcc 60 

tcctcctttc ttcccatcac tagttgccat gtaccgagaa ccatctttgc atgatgttgg 120 

ggaaacggtc ccgaagcctg gggtgacgcc aagtaaaagc acaagtgcgt ctgcaataat 180 

gaatggaggc aaaccagtca acaagagtaa gacaacatag ccagatcctc acaggtgttg ■ 240 

tgacttattc rtcctgagca cagttgagtg atttatcctc accagacatt cctgctccgt 300 

ggctgaagag cagcaggaag taagctaatg cttattcttt gctgtctccg aacttctctg 360 

ttgcaagtgg ataaatctca acctttgcac cccccacaac aagaagacac ctggataa 418 

<210> 27 
<211> 419 
<212> DNA 

<213> Homo sapiens 



<220> 

<22l> allele 
<222> 43 

<223> 8-95-43 r polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 31. . 55 

<223> 8-95-43. probe 



<220> 

<221> primer_bind 
<222> 24. .42 
<223> 8-95-43.mis 

<220> ; 

<221> priiRer_bind 
<222> 44.. 62 

<223> 8-95-43. mis complement 
<220> 

<221> priiner_bin.d 
<222> 1..18 
<223> 8-95. pu 



<220> 

<221> primer_bind 

<222> 400. .419 

<223> 8-95. rp complement 



<400> 27 

gactggaact 

gtgattccgg 

atgtggcaaa 

gtgtgatgat 

tcatagctgt 

cttgcttttt 

aaaagcaata 



attctaggag 
gactctcctc 
tgcacctgct 
cgccgctgtg 
atgctacctg 
caattgctgt 
tacaattagg 



atttgggttt 
ataagtgtgc 
atcctccagg 
aagacctcga 
catgcaactt 
taaaaataag 
tttgtaatga 



aatagtgtgc 
actttcttta 
agatcgccgg 
tggtgtggtc 
gctttacctg 
ataaacagaa 
cttttatttc 



tcrcctcctt 
tactttccag 
gtgtatggca 
tgtggaggta 
cctttgttat 
agcaatatac 
cctgactcct 



<210> 28 

<211> 350 

<212> DNA 

<213> Homo sapiens 



gaaagcctta 
gtaaatgtga 
agacttgtga 
gtaacctttc 
tctactgact 
cattgaacct 
ttatttagc 



60 
120 
180 
240 
300 
360 
419 
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<220> 

<221> allele 
<222> 181 

<223> 8-97-181 ; polymorphic base A or G 
<220> 

<221> misc_binding 
<222> 169.. 193 
<223> 8-97-181, probe 

<220> 

<221> primerjbind 
<222> 162.. 180 
<223> 8-97-181. mis 

<220> 

<221> primerjbind 
<222> 182. .200 

<223> 8-97-181. mis complement 
<220> 

<221> primerjbind 
<222> 1. .16 
<223> 8-97. pu 

<220> 

<221> primer_bind 

<222> 331.. 350 

<223> 8-97. rp complement 

<220> 

<221> allele 
<222> 98 

<223> 8-97-98 : polymorphic base C or T 
<220> 

<221> miscjbinding 
<222> 86. .110 
<223> 8-97-98. probe 

<220> 

<221> primerjbind 
<222> 79. .97 
<223> 8-97-98. mis 

<220> 

<221> primerjbind 
<222> 99., 117 

<223> 8-97-98. mis complement 
<400> 28 




tatagagcca agagacactc gtcatgcaat gctttagtgg gctgga* 



60 
120 
180 
240 
300 
350 



<210> 29 
<211> 434 
<212> DNA 
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<213> Homo sapiens 
<220> 

<221> allele 
<222> 68 

<223> 8-98-68 : polymorphic base A or G 



<220> 

<221> misc__bin.ding 

<222> 56.. 80 

<223> ' 8-98-68 .probe 

<220> 

<221> primerjoind 
<222> 49.. 67 
<223> 8-98-68 *mis 



<220> 

<221> primerjbind 
<222> 69. .87 

<223> 8-98-68. mis complement 
<220> 

<221> primer_bind 
<222> 1..19 
<223> 8-98. pu 

<220> 

<221> primerj>ind 

<222> 415. .434 

<223> 8-98. rp complement 

<220> 

<221> misc_feature 

<222> 414,416 

<223> n=a, g, c or t 



<400> 29 

atttcattgt 

taatcatrat 

tgaatgctgg 

ttaacaatta 

atgcgaagga 

ttctatttct 

agatgcaaat 

gagagcaatt 



ggacacttgt 
acctatatgt 
gatggatgga 
catgagagag 
aaccatgtat 
gaattacgaa 
aacattagaa 
tttc 



gatttataat 
attttattgc 
atggaaatgc 
gtctggattc 
attcaccact 
tgaaatccga 
aaaaaagatt 



agcatgttta 
agggaatgga 
atgtgaaatc 
ttattttttc 
aggacaggtt 
gtacctatta 
cttccataat 



<210> 30 

<211> 477 

<212> DNA 

<213> Homo sapiens 



aatttggcac 
atatgtagct 
tggcttggct 

tgggccatta 

aaaaagacca 
gaaatgagtt 
taacataagt 



attttgatca 
gtggaaactg 
cagaatatcc 
gaacatataa 
ttgtatgttt 
atgcaaattt 
ggtncntaac 



60 
120 
180 
240 
300 
360 
420 
434 



<22Q> 

<221> allele 
<222> 108 

<223> 99-14021-108 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222>_96. .120 

<223> 99-14021-108. probe 
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<220> 
<221> 
<222> 
<223> 



prime r_bind 
89., 107 

99-14021-108. mis 



<220> 



<221> primerjaind 
<222> 109. .127 

<223> 99-14021-108. mis complement 
<220> 

<221> primerjbind 
<222> 1..18 
<223> 99-14021. pu 

<220> 

<221> primer__bind 

<222> 465. .477 

<223> 99-14021. rp complement 

<4 00> 30 

tttgttgtta atcgcccctt ttctgcaaca cttgtgggtt agggaaaata attctaaagc 60 

aagagcaaag acagagttgg gagatcacca gtgaggttca attttccrtc acattcactc 120 

tgctccacac ctcagataat catgtgctta actgcgaaac ttgcttgaca attacagaac 180 

actttctcac ccattactac cttgatcctc acaattctgt ggggtagtag gagcagatgc 24 0 

tgaaattgcc atacgcaaat cagtgaactg aagcttagag acctccagca ggggcagagg 300 

gtcagcggaa actatcccag ggttcagcca acaagaaagt atattggaat cagagtatta 360 

aaataagaat aataaaacca actaaaattt accgtgcttt ttatttccac tcagtgccaa 420 

caattcttaa cagtgtcagt gatggatccc tgtgccccag ggrmcagact tcttact 477 

<210> 31 

<211> 473 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 415 

<223> 99-14364-415 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 403. .427 

<223> 99-14364-415. probe 

<220> 

<221> priiner_bind 

<222> 396. .414 

<223> 99-14364-415. mis 

<220> 

<221> primer_bin.d 
<222> 416.. 434 

<223> 99-14364-415. mis complement 
<220> 

<221> primer_bind 
<222> 64. .82 
<223> 99-14364. pu 
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<220> 

<221> primer_bind 
<222> 434.. 454 

<223> 99-14364. rp complement 



<4 00> 31 

catgcccatc 

aacttcaaca 

ataagatacg 

gtgctattgc 

aaatactcaa 

tttggctgat 

actagaattt 

tattgcttgt 



ctaaatattt 
gcatctctag 
aaatgataga 
caataataca 
gaaaataggc 
tcagtcagtt 
tcctattact 
caaatttcag 



acatgccctg 
tacaactgtt 
attgctctag 
tgctgtacct 
cacatagtta 
gctttcaagc 
attaatttta 
cagaagtaca 



acagataagc 
gctcattctg 
attctctatt 
ggataaaccc 
caacaggact 
ctaggttctt 
ctttgtattt 
actcctgaga 



ccttagcctg 
tgttttaatt 
ggttaaataa 
ctttgggcaa 
tacctaattc 
ggctcaatat 
ttctttataa 
gaataatgct 



cccaatgggt 
caacccagct 
ggagatattt 
gttgtgatgc 
cccatggtca 
tattacataa 
acttygtact 
ggc 



<210> 32 

<211> 502 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 99 

<223> 99-15056-99 : 



polymorphic base C or T 



<220> 

<221> miscj^inding 

<222> 87 ..111 

<223> 99-15056-99. probe 

<220> 

<221> primerjbind 

<222> 80. .98 

<223> 99-15056-99.mis 

<220> 

<221> primerjbind 
<222> 100. .118 

<223> 99-15056-99. mis complement 



60 
120" 
180 
240 
300 
360 
420 
473 



<220> 

<221> primer bind 
<222> 1. *18 ~ 
<223> 99-15056. pu 

<220> 

<221> prime rjoind 
<222> 482. .502 

<223> 99-15056. rp complement 



<400> 32 

caggaaactc 

cttcttaact 

tctccacgta 

tatatcaaac 

aattttcttt 

ttaggaataa 

gtttcgatga 

ctcaagttat 

ccccaaaata 



acaagaagsc 
cttacaattt 
gacttcctaa 
aagttaccac 
gcctcttatt 
gcttgtgatc 
aacagtgagc 
tgtaaacata 
cgtctccctt 



agatttcctt 
acaagtggct 
ataataattt 
ttaatcaaat 
tttggatctt 
tagccttctt 
aacatttgat 
ggcataattg 
tt 



cgagcacctc 
atgagtgcyt 
ctcctgtttt 
gccgatttgc 
aattctaaaa 
ttgaacccct 
ctatgattgt 
acctttggtt 



ctgaataaag 
ttatagttcc 
atattctctg 
attgctcact 
ctagatgatc 
ttgtgctcct 
taatagaaaa 
ctataaatat 



aggcaaaggc 
cataataatt 
tgcttatgtt 
atgtaacttt 
ataaattcat 
cacaatattt 
acaccaatgt 
gtttggtgtt 



60 
120 
180 
240 
300 
360 
420 
480 
502 
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<210> 33 

<211> 4 94 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 412 

<223> 99-15229-412 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 400. .424 

<223> 99-15229-412. probe 

<220> 

<221> primer_bind 

<222> 393.. 411 

<223> 99-15229-412 .mis 

<220> 

<221> primerj>ind 

<222> 413.. 431 

<223> 99-15229-412. mis complement 
<220> 

<221> primer_bind 

<222> 1. .20 

<223> 99-15229. pu 

<220> 

<221> primer_bind 
<222> 476. .494 

<223> 99-15229. rp complement 
<400> 33 

ctgtcattga gaaatgctac caataatact tagagaattt gatacaactc agtctgaaaa 60 
agctaagatt agcagaacag agctgtctcc aaatatttga agaactattt tatttaaggg 120 
attggaccca tttttgtatg tagttccaga ggagcagatg gtgaccactg tccaggcaga 180 
tgtgtctcaa tgtaaggaca acatctgtaa tattaataat tagaatgtat cctgtaattt 240 
tctctctacc cttggaaacc agtcgagatc cagagtcttt cactgggagg cttaaagcct 300 
agagcagcct tggtgctaga ggcggacagg gataatgaac taatcttgaa ccaattcatc 360 
catagcaatc tcaatgcttt cgttagctct tataggtatt taatacggcc araggaatga 4 20 
aggtagtctt gctggtttag aagccctgcc taccacaacc cctacaccac cccatcccct 480 
gcatagtctg atgt 494 

<210> 34 

<211> 485 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 291 

<223> 99-15232-291 : polymorphic base G or T 
<220> 

<221> misc_Jbinding 

<222> 279.. 303 

<223> 99-15232-291. probe 
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<220>. 

<221> primer__bind 

<222> 272. .290 

<223> 99-15232-291. mis 

<220> 

<221> primer_bind 
<222> 292. .310 

<223> 99-15232-291. mis complement 



<220> 

<221> primerjoind 
<222> 1. .18 
<223> 99-15232. pu 

<220> 

<221> primer_bind 
<222> 467. .485 

<223> 99-15232. rp complement 



<400> 34 

caatagaaca 

atacgtggct 

attcagactt 

atataaattg 

cctagttaag 

tagtgggtgc 

caggaggtgc 

tcacatggtg 

gccag 



ggctgctcct 
agaaaaaaat 
ctttgaggag 
agagaataaa 
actatttaag 
cttctgaagg 
ctggcattcc 
tcccctgtgt 



ttataattat 
ttagaacaaa 
agcttttatt 
cagacaaaca 
actattatgc 
atctgaagga 
ttggcttgta 
gtgtgttttt 



taatcatagt 
aagatatgtg 
ttattgttat 
ttacaaatta 
ataatacagg 
gaatcagttc 
gacgcatcac 
gccccatgtg 



gtatattaat 
atatgtaaag 
tcttatttta 
gtgattaatt 
aaaactacct 
tatgcctctc 
cctaatctct 
tctcctcttt 



tcatcatcac 
gcctacgata 
tctcttgtca 
gcatttaaag 
ktattattta 
tcctcattcc 
acctctgcct 
ttatatggat 



<210> 35 

<211> 477 

<212> DMA 

<213> Homo sapiens 



60 
120 
180 
240 
300 
360 
420 
480 
485 



<220> 

<221> allele 
<222> 382 

<223> 99-15253-382 ; polymorphic base C or T 



<220> 

<221> misc_binding 

<222> 370.. 394 

<223> 99-15253-382. probe 

<220> 

<221> primer Joind 

<222> 363.. 381 

<223> 99-15253-382. mis 



<220> 

<221> primer_bind 
<222> 383. .401 

<223> 99-15253-382. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-15253. pu 
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<220> 

<221> primer_bind 
<222> 459.-477 

<223> 99-15253 .rp complement 



<4O0> 35 

aaaatcaatt 

ctattttatg 

ttttttttac 

tccataatgt 

tcatcttttc 

aaataaataa 

tgcacagcac 

attacacagg 



ccccaacact 
gtttcatgtg 
aatattttat 
gagtggtaaa 
cttaattttt 
attactttgt 
agcctgtaat 
tcctcattgt 



cattttgtac 
ttacagatga 
gattacaaag 
tgcagagcac 
ttccaaccta 
aaataatctt 
tyctatgaac 
gtaactaaac 



gctaattttg 
aaaaaaaact 
ttagaagact 
tggcctaaag 
tgatctgttg 
gactggtttc 
ctagggaaat 
attgcataga 



taagatcctg 
agaattcaaa 
aagaataaaa 
aaaatatttc 
aatgagcatt 
tgttgaccac 
gtatttaagt 
atatgccagt 



aaaagtttca 
ttttctgagt 
tggcctaatt 
aaaaaattag 
ttgcatatat 
agtaacccac 
ttattttttg 
gatgatg 



60 
120 
180 
240 
300 
360 
420 
477 



<210> 36 

<211> 456 

<212> DMA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 392 

<223> 99-15256-392 : polymorphic base C or T 
<220> 

<221> miscjbinding 

<222> 380. .404 

<223> 99-15256-392. probe 



<220> 

<221> primer_bind 

<222> 373.. 391 

<223> 99-15256-392. mis 

<220> 

<221> prime r__bind 
<222> 393.. 411 

<223> 99-15256-392. mis complement 
<220> 

<221> primer bind 
<222> 1. .18 ~ 
<223> 99-15256. pu 

<220> 

<221> primer_bind 
<222> 439.-456 

<223> 99-15256*rp complement 
<400> 36 

cctctctatg atgcttccta ttaagcaatt ggggaaatgt aataaacaag ggttggtgag 60 
catcttcctt agtgagatgt ttttggaaga attggataat tgagtgaata atagtgagaa 120 
actcctgtgt ctgatgttgc tccatgttgg aatgctttta tgttctcaga gaatgagtca 180 
ctgagagcca attgtgatga tacacaatgg ttttacccag gttggatatg gtcctctgta 240 
ctggtaccct ttaagtcagt ggcactaatc agtcagtcat tgtcatgctt tgtgttggtc 300 
catcatatgg tatgccctct tagagaacat cctgattagt ccttagacat cttttcaatt 360 
tgaacactgg ggctcctcat tcgggtaaaa aytatggaca gtcagtgaaa ctgttgcaat 420 
ggcccctcat agcagattgg atctcaatgc actttg 456 

<210> 37 
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<211> 541 
<212> DMA 
<213> Homo sapiens 

<220> 

<221> allele 
<222> 337 

<223> 99-15258-337 : polymorphic base g or T 
<220> 

<221> miscjbinding 

<222> 325.. 349 

<223> 99-15258-337. probe 

<220> 

<221> prime rjbind 

<222> 318. .336 

<223> 99-15258-337. mis 

<220> 

<221> primer bind 
<222> 338.. 356 

<223> 99^15258-337. mis complement 
<220> 

<221> primer bind 
<222> 1..19 
<223> 99-15258. pu 

<220> 

<221> primer bind 
<222> 521.. 541 

<223> 99-15258. r P complement 
<400> 37 



«K K££y t c T taata9 atcagtaaat 

aaaatttaca aactgatgtg atttlttlTc cS 9 ! gtg ^ cctt ^Sgaaatac 

aaaaaatata aatgggggc! aotc^cL 9 ?** tcttat 3aaa gattaaatta 

tactagggaa aat?Sgaag tagaatSa tatattccat gtgactgaaa 

totacaggat gaaggaagaf ttgccca^at gccaltkocc t^* 9 "! tct 9 aaa 9^ 

. gaaatcaata aggcatgtat ggaaatacaa Ittntt^ 9 cacaaaaaca 

agggatcagt tfacacaaac aajaagagtt SSccSa? aagagcgtga 

aaaattaage acttcccagt atL^ag lT g llT^ ~? c l l^SS 

<210> 38 

<211> 500 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 200 

<223> 99-15261-202 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 188. .212 

<223> 99-15261-202. probe 



60 
120 
180 
240 
300 
360 
420 
480 
54 0 
541 
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<220> 

<221> primer_bind 

<222> 181. ,199 

<223> 99~1526l-202.mis 



<220> 

<221> priiuer_bind 
<222> 201. ,219 

<2 23> 99-15261-202. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-15261. pu 

<220> 

<221> primer_bind 
<222> 481. .501 

<223> 99-15261. rp complement 



<400> 38 

cttctaatcc 

attccatttt 

tttctttctt 

ctttccattt 

gtcttacttt 

aaatacttta 

tggggttttt 

ttgtttaaaa 

gtagacatta 



tttgtttcca 
attatactta 
agtcttctta 
cttcaccgar 
gccatttttg 
tcatggtggg 
ttgtttgtgt 
attctggacc 
gcagtattta 



cttattttat 
aggtgctttt 
tataagcgct 
cctctttgct 
aaatttattt 
aggtacccgt 
gttttttgaa 
aaatttacca 



ttcattcctc 
aatatggtta 
atacgttcac 
ctcttttttt 
tcattctttt 
gatgtccaaa 
aggttaagaa 
cacatcaagc 



attttatccc 
tcatactcct 
attccatctc 
atagctggtt 
atgtactgaa 
taagtgttta 
aatctcattc 
agatacttac 



ttttttctaa 
gatagtgtta 
ctttggttat 
cactcaaaat 
taaaatttaa 
tattaattgt 
agaaagtaag 
caagttgttt 



60 
120 
180 
240 
300 
360 
420 
480 
500 



<210> 39 

<211> 541 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 432 

<223> 99-15280-432 



polymorphic base C or T 



<220> 

<221> m±sc__binding 

<222> 420. .444 

<223> 99-15280-432. probe 

<220> 

<221> primer_bind 

<222> 413. .431 

<223> 99-15280-432. mis 



<220> 

<221> primer_bind 
<222> 433. .451 

<223> 99-15280-432. mis complement 
<220> 

<221> primerjbind 
<222> 1. .18 
<223> 99-15280. pu 
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<220> 

<221> priraer_bind 
<222> 521.. 541 

<223> 99-15280. rp complement 



<400> 39 

atgtccatcc atcttgccca gagagagttt ctacaacact Hcctctgcaa gccctttccc 60 

tacttgcctc acctattgct ttcctctgtt acgttgtiatt. cccctcactg tttcttccaa 120 

catettccca cctcagagca tggacacttg ctgctctttc tgtgtcatga tgctgctcac 180 

ttgtcccttt cttaatgtct cctccctgag ccaatcttct ccacccccac aacttacgca 24 0 

cacttacatg tcatattttc cttcatagcc tttaacacca tttgaaatga tatatatttg 300 

attgctttta aaatttctct gtccccccac taaatataaa cttcaggatg gcaagaatgt 360 

agtccattat cttatttctc cagcctccat acttttaaga aaataaattt tggttgtata 420 

agccatccag tyagtggtac ttggttatag cacccctagc aaaagaatac aaaaaaaggg 480 

agaatgtttg caatcatctg tttgaggcta ggaattccca gagaggraarn caaggagtaa 540 

t 541 

<210> 40 

<211> 489 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 150 

<223> 99-15355-150 : polymorphic base C or T 



<220> 

<221> misc_binding 

<222> 138.. 162 

<223> 99-15355-150. probe 

<220> 

<221> primer_jDind 

<222> 131. .149 

<223> 99-15355-150. mis 



<220> 

<221> primerjbind 
<222> 151. .169 

<223> 99-15355-150. mis complement 



<220> 

<221> primerjbind 
<222> 1..18 
<223> 99-15355. pu 



<220> 

<221> primerjoind 
<222> 471. .489 

<223> 99-15355. rp complement 



<400> 40 

taacttctcc 

catctttaaa 

ttcatcttct 

tggatctgct 

gttcttgcct 

ttcattacaa 

tcaaaatttc 

tttattctct 



gtctctcctt 
ctgcctattc 
actttgggca 
agcacttctc 
gtaaaatggg 
acttcctcat 
tgtagattat 
catcattatc 



cttagcccat 
cagccacctc 
aaaggaaaty 
tgtgtgtcct 
gataataaaa 
tca'aaatctt 
tgagacaagt 
aacaactact 



atgtcaataa 
ccacctccat 
gatgtgtcag 
tggttatatg 
acctcttaac 
caatgatttc 
cccctactct 
aggctttgtt 



tgactgaaag 
ctctttcctt 
acaggcctag 
atatagtctt 
agtggttgtt 
catttttcac 
tcacctaaat 
gccttgactc 



tattcatttc 
ctaagttttc 
ttttgaattc 
aaaccttaat 
tcatgcagct 
aaaatgaaat 
ttatctttta 
cagaggcaaa 



60 
120 
180 
240 
300 
360 
420 
480 
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aatcttatc 

489 

<210> 41 

<211> 450 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 298 

<223> 99-15663-298 : polymorphic base C or T 
<220> 

<221> misc__binding 

<222> 286. *310 

<223> 99-15663-298. probe 

<220> 

<221> primerjsind 

<222> 279. .297 

<223> 99-15663-298. mis 

<220> 

<221> prime r_bind 
<222> 299. .317 

<223> 99-15663-298. mis complement 
<220> 

<221> primer__bind 
<222> 1. .18 
<223> 99-15663. pu 

<220> 

<221> primer Jbind 
<222> 430. .450 

<223> 99-15663. rp complement 
<400> 41 

™^^ C ttctaaac ^ gttgcttcaa tacgttgata ggtgaggaca cttaaaaatt 60 
tt™ I tt ? a f ata 93tt tttttttgtt tacatatata gttcttttgg tatcatatat 120 

n t tC ^^ at r atttttt * at actgaaggga gaaataggga gttattaatc 180 
aacaggcatt aattttagtc aagcaaaata aataagctgt agcgatctgc tctgtaacat 240 

°£ aCa ^ ccaacaatt atatgttgtc cacttaaaaa tgtgttagat ctcatagyaa 
ctcttcttac cacaataaag taaaaattct gaaacaataa gtgaatacct aaataataca 
aacaaataca atattgtagt tttgggcact taataaatga cagcctcatt tctcaattag 
agatcatcac aagttagaca gatgacgatg 

<210> 42 
<211> 502 
<212> DNA 
<213> Homo sapiens 



300 
360 
420 
450 



<220> 

<221> allele 
<222> 185 

<223> 99-15664-185 : polymorphic base G or T 
<220> 

<221> misc_binding 

<222> 173. .197 

<223> 99-15664-185. probe 



157 



WO 01/40493 



PCT/IBOO/01S07 



<220> 

<221> primer_bind 

<222> 166. .184 

<223> 99-15664-185. mis 

<220> 

<221> primer_bind 
<222> 186. .204 

<223> 99-15664-185. mis complement 
<220> 

<221> primer_bind 
<222> 1..19 
<223> 99-15664. pu 

<220> 

<221> primer Jbind 

<222> 4 83. .502 

<223> 99-15664. rp complement 

<220> 

<221> misc^f eature 
<222> 54 

<223> n=a, g, c or t 



<400> 42 

gtttaccatt agcactgtca tatttgtgtg acttgtcatt ctctacagcg gagnacgggc 60 

tggcacgggg cctgatgctg acttgcacaa gggaagcctc ctgtctctga cttccccagg 120 

ataattcctg gggaaagtgt gctccctagt gttaagagcg gtttaatggc tggagggttt 180 

cagckggctg accaggcaga gaaggagggt gaatcacctc tcagcactct ccacttagac 240 

tttgtgtggt cgtcgggtgg tcaaaccttc taactagttg tattgcagat ttggcattcc 300 

agtgcaaaca aaagacagaa acacaatgtt cacatgcttt ccagagatca cctgga'tatc 360 

agatcatttg attttcaagt aagtcgaaac cttggtggaa atcattaact atcctgttta 420 

tgaccaaaaa ataaaatccc aaatttcttc tcttcatttc ttacctgctt taaaattgta 480 

tccaaagcgt gaaagtaaaa ga 502 

<:210> 43 

<211> 476 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 398 

<223> 99-15665-398 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 386. .410 

<223> 99-15665-398. probe 

<220> 

<221> primer_bind 

<222> 379. .397 

<223> 99-15665-398. mis 

<220> 

<221> primer_bind 
<222> 399. .417 

<223> 99-15665-398 .mis complement 
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<220> 

<221> primer_bind 
<222> 1. .20 
<223> 99-15665. pu 



<220> 

<221> primer_bind 

<222> 458. .476 

<223> 99-15665. rp complement 

<400> 43 

cgtaaatgtg aaaagcatag cctcttcttg gaatgttaag tataaatatc tgaaatactg 60 

ggcttgatat gtcaacagga gattgatgga taaaaataga attttatata aaaaacaact 120 

ggacatatta gattgttaac ttggaagaaa gaccatattc aaagaagaaa acatagtgac 180 

taatttcaaa catttaaagt cttccctgtg gaaacaaagg aatatctttg ttctaacact 24 0 

tcaaagaaca gggttaaaaa atagactcac cacagagtaa atgcacaatt gacaatcgtg 3.00 

aatgaattaa aaaccaaaca aaatattttg tcagctttct atctatgaaa ctaagaaaca 3 60 

ggcttcctac taaggtaatg aatgtaattc acagagarca ttcacgtata agtttcattc 4 20 

atgttteaaa tttcattgat ttgatcaatg ggttattcta ataccctccc ttattt 4 76 

<210> 44 : 

<211> 459' 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 139 

<223> 99-15668-139 t polymorphic base C or T 

<220> . 

<221> misc_binding 

<222> 127, .151 

<223> 99-15668-139. probe 

<220> 

<221> prime r_bind 

<222> 120.. 138 

<223> 99-15668-139. mis 



<220> 

<221> primer_bind 
<222> 140.. 158 

<223> 99-15668-139. mis complement 



<220> 

<221> primer_Jbirid 
<222> 1. .18 
<223> 99-15668. pu 



<220> 

<221> primer_bind 

<222> 439. .459 

<223> 99-15668. rp complement 



<400> 44 

agaggtgaga gaaaggtgaa ttgtgcaaat atggacatcg aagttattca gacacatatg 60 

gtttccatgt cgtgagacct taaagcactc tcagaaactc caagaaagct gttgattggt 120 

cagatttctc acttttgtyc cctcaaaaca agattgtcct gaacttatag gaaatatact 180 

caaaagtagt atgatttaag gaagtcttcc aaatacctgc ctcgtttaca gataatcaat 240 
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ggaattccaa aagattttat gtgttttaag ccctaaaaat gcaaatatcc aggaaagggc 300 

tagcactaaa cagagctgg.c atgtatgcct cagtgcaggt gtatatctca gtgtctcctg 360 

aagccagggc cttagaggat tgtggaaata ggtacactga cagcagggac tggaagccac 420 

agtctggaga gcaaagagca gaatccatac ttacacctc 459 

<210> 45 

<211> 547 

<212> DMA 

<213> Homo sapiens 

<220> - * 

<221> allele 
<222> 166 

<223> 99-15672-166 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 154.. 178 

<223> 99-15672-166, probe 

<220> 

<221> primerjDind 

<222> 147. .165 

<223> 99-15672-166. mis 

<220> 

<221> primerjDind 
<222> 167. .185 

'<223> 99-15672-166. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-15672. pu 

<220> 

<221> primer_bind 
<222> 533. .547 
<223> 99-15672. rp com] 

<400> 45 

ccaataccat aactcctcta 
tgattattat tatcagtgtt 
acatctgtgc ctgtgcgtat 
aagtctagct gtatttaatc 
taaaacatgt aaactctaag 
tctggtgtga atattctctt 
gtgtttgaaa gagacatatg 
catgtacaga agttctcata 
taatagactg tcaatatttc 
aaaaggg 

<210> 46 
<211> 500 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 318 

<223> 99-15682-318 : 



pi erne nt 



taggacatgg 
attattatcc 
gtgcacgtgt 
aacctacctg 
ctctctcctt 
agttctacat 
tgacactgct 
ttactttaca 
taggataact 



aagagtatta 
taatcctaag 
gtgcagtcaa 
aatcctcagg 
atagtcagtt 
gagtctaact 
gttttgagaa 
taaatggttg 
ccaaaacaaa 



tatatgacaa 
taatccaata 
atacaygttg 
aaaaaattct 
agtagcagca 
aaacagagga 
caatttaagt 
cataattgtt 
atttcctaga 



atgattgcta 
aaagaaaaat 
agtaaaggta 
aaacctagtt 
catcttaaaa 
ttattcttag 
gttgtcttgt 
ttatagtaaa 
mmacattttg 



60 
120 
180 
240 
300 
360 
420 
480 
540 
547 



polymorphic base A or T 
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<220> 

<221> misc_binding 

<222> 306.. 330 

<223> 99-15682-318. probe 

<220> 

<221> primer Jbi'nd 

<222> 299. ,317 

<223> 39-15682-318. mis 

<220> 

<221> primer bind 
<222> 319. .337 

<223> 99-15 682-318. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-15682. pu 

<220> 

<221> primerjsind * . 

<222> 482..500 

<223> 99-15682. rp complement 
<400> 46 

gtttgatgag caacaagtgg ccacacaagt attggagaaa aaaaaaatta gaatgtgggg 60 

agagaggcag tagacacaat aaaaaattct atataaeaaa tgctatatct ttaagaaatc 120 

gaatctttcc attaaatatt tgtttaaagt tacagactac ttagatgtta aagtatcact 180 

tatcacaaag atcagtaact tatttcctct gtatagtaga tgcctcttct gtagaattaa 240 

catttcacta taatctctaa gatacagttc aagggcacaa ttttataatt caaagatatt 300 

tacggcatga aattgagwtc taaaactctt gtatctttaa caaccatttt gtaattaaat 360 

aactgttgac atttacaatc attgaagaac tatttgaact ctgtagagta tagatattga 420 

agtacaaaat taattatgag ataatatatg gagcctgkcg gctttatttt ttaataagtw 480 
tgtatgtatg ccaataaagg 500 

<210> 47 
<211> 4 61 
<212> DKA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 217 

<223> 99-16081-217 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 205. .229 

<223> 99-16081-217. probe 

<220> 

<221> prime rjoind 

<222> 198. .216 

<223> 99-16081-217 .mis 

<220> 

<221> primer_bind 
<222> 218. .236 

<223> 99-16081-217 .mis complement 
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<220> 

<221> prime rjsind 
<222> 1. .18 
<223> 99-16081. pu 



<220> 

<221> prime rjbind 
<222> 443. .462 

<223> 99-16081.rp complement 



<400> 47 

caaaggacag 

gggaggaggg 

agggagttct 

aaagaaaaca 

agttctagcc 

tggcttttat 

ggaatacttg 

ggttggcaat 



agaaccagca 
cggagtcgga 
ccatgaagtt 
ttaggcactg 
ataataaagg 
acagtatcat 
gaatacacat 
taagaactat 



ggcaggccct 
agggagcgag 
ccagaagcca 
gtggcttcgc 
catttgaatt 
gccagaaaca 
ttttcagctg 
aacttactcc 



gggtcaagcc 
ggagccaggg 
atgaccagat 
tctaaaygaa 
atgttgcttt 
aatttataac 
agctctaaga 
accccactcc 



ccgtgatgtc 
aggcagggcc 
cggcgcatgc 
cagttcttaa 
tgatgaaact 
attgttatgg 
attacttttc 
a 



ccaggaggag 
ctctgagggc 
aaacagctta 
aactgctttc 
ataaagatat 
tataacagca 
ccttgcccca 



60 
120 
180 
240 
300 
360 
420 
461 



<210> 48 

<211> 532 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 218 

<223> 99-16082-218 ; polymorphic base A or G 
<220> 

<221> miscjbinding 

<222> 206. .230 

<223> 99-16082-218. probe 

<220> 

<22I> primer__bind 

<222> 199.. 217 

<223> 99-16082-218 .mis 



<220> 

<221> prime r_Joind 
<222> 219. .237 

<223> 99-16082-218. mis complement 



<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-16082. pu 



<220> 

<221> primer_bind 

<222> 512. .532 

<223> 99-16082. rp complement 



<400> 48 

caattcagca agagccactt tggttaacaa accttctggg aatgaaggtg ctttacaaag 60 

aagcaggctg tgataaaaac aaacaagaca acattcataa actgtgggaa gcattgttgt 120 

cattgggtca cacctcctgc accaggcagg tggttgtctg ctgcgttcct ttacagaaca 180 

gatgctgagc ctgcctggag gtggcaattt accaatcrat ggtcttaagc aattccaaag 240 
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gctggggtaa ggggtagaaa agagaagctc aaaaatgtta gggatggaga aggggggagt 300 

gattcaggaa gaggagaaac ctgttcattt cctcccaaat tacaaccact gtcacatcca 360 

tggctagtcc tgtgcaaact tccagtcccc agctgatatc cctgcagaca agtagagaga 420 

ggctagaaca aaagctaaag tgtagatgtc ccatacacta atcagtgttg. ttttttaatc 4 80 

agaggttgaa attcatactc tgatgcaaac tgtattagtc agwtgttgct gc 532 

<210> 49 

<211> 4 51 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 

<222> 81 

<223> 99-20933- 



81 : polymorphic base A or C 



<220> 

<221> misc_binding 

<222> 69. .93 

<223> 99-20933-81. probe 

<220> 

<221> primer_Jbind 

<222> 62. .80 

<223> 99-20933-81. mis 



<220> 

<221> primerjaind 
<222> 82. ,100 

<223> 99-20933-81. mis complement 
<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-20933. pu 

<220> 

<221> primer__bind 

<222> 432. .451 

<223> 99-20933. rp complement 



<400> 4 9 

ttccattcac 

tgcttgtata 

gagatatatt 

gctctttctg 

tactgctaga 

gaatcgggtg 

cctagawaac 

aatttctggg 



atgagaaact 
ttttcagtaa 
tgccaagtca 
ggcctgagta 
cgtatttcct 
cttccagaaa 
atgaacatga 
agaacatttt 



atgtgtttac 
mtgctggggt 
acagtctcca 
atcagtaaat 
tacatctgaa 
atcctgtaaa 
tcccatgtct 
gtagaagaca 



tatttctgtt 
agcaagtgtc 
gactattgtt 
atataaaaag 
cattttcttt 
aatcatgtag 
tatctttact 



atgttttgta 
agaatcactc 
tcagcaagac 
aataagaata 
catttgcaac 
cttccaagat 
. catacatcag 



tgattttgtg 
caaaaaaact 
aaggggcaag 
atagtgaagt 
acttaacctg 
ttaaagctgg 
gaattctcta 



60 
120 
180 
240 
300 
360 
420 
451 



<210> 50 

<211> 511 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 71 

<223> 99-20977-72 : polymorphic base A or C 
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<220> 

<221> misc binding 
<222> 59. .83 
<223> 99-20977-72 -probe 

<220> 

<221> primerj>ind 
<222> 52* .70 
<223> 99-20977-72. mis 

<220> 

<221> primer__bind 
<222> 72. .90 

<223> 9 9-20977-72. mis complement 
<220> 

<221> primer_bind 
<222> 1..18 
<223> 99-20977. pu 

<220> 

<221> primer__foind 
<222> 491. .511 

<223> 99-20977. rp complement 
<400> 50 

gtacatggta ggtggtagat 
ctaaaacatg maggcctaaa 
gttttcactc tccatgttac 
atatcatatg gtaaatgaca 
agctattaaa atttgaagca 
actcatgact gaaatctcaa 
ttttttaaga taattattct 
gtcaggcact gtgagaaccc 
caatgcagct ccaataacta 

<210> 51 
<211> 466 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> allele 
<222> 89 

<223> 99-20978-89 : polymorphic base G or C 
<220> 

<221> misc_binding 

<222> 77.. 101 

<223> 99-20978-89. probe 

<220> 

<221> primerjbind 

<222> 70.. 88 

<223> 99-20978-89. mis 

<220> 

<221> primer_bind 
<222> 90. .108 

<223> 99-20978-89. mis complement 



ggtgaaataa tttttatatg taaggagttt caaagatgat 60 

acaatagatg aaacactatt. tcaaaatgta ttctctcttt 120 

aaaaatggca acaaattacg taaatgccaa ggaagccaat 180 

aatcaactgt aacattgaaa caacaattaa tagctattat 240 

aaattgtgca tctttaaaaa ttata.cacac atgtgagagg 300 

tataccatct atccaagttg agttattttc cttttttttt 360 

tcttcctact tctgggaggg acagtgta'tg ggaggtaect 420 

tgggctctga acagatatgg ggtatttctc tcagtatcct 480 

atcccatttc t 511 
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<220> 

<221> primer_bind 
<222> 1. .19 
<223> 99-20978. pu 

<220> 

<221> primer_bind 
<222> 446. .466 

<223> 99-20978. rp complement 



<400> 51 

acttaactgc 

cactaaaaat 

agattaactg 

caaagcagta 

catctcactt 

aaattgctag 

atcttccgca 

ctctcttagg 



tacctaatge 
aaaatgcaaa 
ccactgtgtc 
acgcaaaggt 
ttccattggc 
ttcagtacta 
tctctcgctc 
tggtcttccc 



agcctgtaca 
tcaaatctst 
taaacccaaa 
tagaaaggat 
cacatttcca 
taagtttaaa 
tttttctttt 
acctgcagtt 



agcggaataa 
ttaaaaaacc 
tcctgctagc 
caaagatctt 
atacatctct 
acaaatgggt 
tttctatttt 
ttccttttca 



ttgaactaaa 
aaaaccaact 
cagagctgaa 
gaaaaacctc 
gccccattaa 
tgctcaattc 
cactgccagc 
aatcct 



atgtcttcaa 
tgtattttga 
tttgtgatta 
ttctcagatc 
aaccttacca 
cttttgatgc 
ccctcattac 



<210> 52 

<211> 449 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 300 

<223> 99-20981-300 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 288. .312 

<223> 99-20981-300. probe 

<220> 

<221> primer_bind 

<222> 281. .299 

<223> 99-20981-300. mis 



60 
120 
180 
240 
300 
360 
420 
466 



<220> 

<221> prime r_bind 
<222> 301.. 319 

<223> 99-20981-300. mis complement 



<220> 

<221> primerjoind 
<222> 1..18 
<223> 99-20981. pu 



<220> 

<221> primer_bind 
<222> 429.. 449 

<223> 99-20981. rp complement 
<400> 52 

tttgccatcc ctttgctgat ggtttgattt ggaatgtgtc atttcgtttg tgaaatgaga 60 
catagtcaag gtacccatgt gctcaagtgc ttagattaca acaatgtttt gatgcaaatg 120 
tgcaaagtac tcagagatca acactatggt ctaataacag gccatagcaa aatagagcaa 180 
atgaatcctg tgaaatcaga gagaatgcat caaataaaga tgtactacct aaagtgcaaa 240 
atctactagg agcactttga catttcagtt caagttctac agtgtttcac attttttcar 300 
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attatgatct tgtttgtgL gaagttJc tCttgtgaat ^cactagat atgaattagg 420 

449 

<210> 53 
<211> 449 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 48 

<223> 99-20983-48 : polymorphic base A or G 
<220> 

<221> misc binding 

<222> 36. .60 

<223> 99-20983-48. probe 

<220> 

<221> primerjbind 

<222> 29. .47 

<223> 99-20983-48. mis 

<220> 

<221> primerjbind 
<222> 49. .67 

<223> 99-20983-48. mis complement 
<220> 

<221> primerjbind 
<222> 1. .19 
<223> 99-20983. pu 

<220> 

<221> primerjbind 
<222> 431. .449 

<223> 99-2098 3. rp complement 
<400> 53 



60 
120 
180 
240 
300 



sjsss srsxs skk? r awte - 

agagtggata agactt?gat ga^cttgala at c ^tcagaacac tatcttatgt 

tttctttcct tgattaafga !SS«SS KSSSf SSST 
agaggtgatg coagccoaca gagatgacca Jtccctat^ ^ 9 5? tS gata ^ ta ^ 
ttaagtgaag aagtatgctg gtctgctttc SactS^ = a ^gttct gagttgttgc 
aacagttaga aaatccaaa? atttaat^tc ► 9 aaacttca 9C ttagtggtag 360 

accaactctg taccatcaoc ttggSaa" aatgcct ^ tagtccattt actgattttc 420 

<210> 54 
<211> 466 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 148 

<223> 99-22310-148 : polymorphic base C or T 
<220> 

<221> miscjbinding 
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<222> 136, .160 

<223> 99-22310-148. probe 

<220> 

<221> primer^ bind 

<222> 129. .147 

<223> 99-22310-148. mis 



<220> 

<221> primer_bind 
<222> 149. .167 

<223> 99-22310-148. mis complement 



<220> 

<221> primer_bind 
<222> 1..19 
<223> 99-22310. pu 



<220> 

<221> primer_bind 
<222> 446. .466 

<223> 99-22310. rp complement 
<400> 54 

gtgtgatacg agatcattct ttttttaaaa 

gttgaaagct accgatatag agcaaagctt 

tgtgaaatga ctcctgaagg caccaaaytg 

taaatgtgtg cttgtatttt actgaatccc 

atattattgc aatacaaact gggaaagata 

atagcaatat ctcattgatt tagagcagag 
tcgatgactg gcagagaagc- gggtaagaat 

tgctaccctg ttgggacaaa gctctctaaa 

<210> 55 

<211> 471 

<212> DNA 

<213> Homo sapiens 



gctccctagg gtttcagaag tgcagcacag 60 

tactaaaagc tgatatgtgg aagcaaagaa 120 

acaatgcaaa tcaaatccaa gaacaagctc 180 

ttttgtaaac ccatgcaata ttttacccca 240 

agatactgga atgtaacctg catcagacag 300 

cagaccaagt tttggagtag agggagtagt 360 

ttaaagacaa tagtagaatc attatctctc 420 

ataagcaact acacag 4 66 



<220> 

<221> allele 
<222> 147 

<223> 99-25922-147 : polymorphic base G or T 
<220> 

<221> misc_binding 

<222> 135.. 159 

<223> 99-25922-1 47. probe 

<220> 

<221> primer_bind 

<222> 128.. 146 

<223> 99-25922-147. mis 

<220> 

<221> prime rjoind 
<222> 148.. 166 

<223> 99-25922-147. mis complement 
<220> 

<221> primer_bind 
<222> 1. .21 
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<223> 99-25922. pu 
<220> 

<221> primerjbind 

<222> 451. .471 

<223> 99-25922. rp complement 



<400> 55 

ctctaatatc 

cactcacagg 

tg'atgagaat 

tagcagaact 

gccaagatat 

ctaaactatc 

acctttttct 

caaagtatat 



aataaagcct 
agaattttcc 
catacaaaat 
attaaaaaaa 
ccccaaatat 
aaaaatgaaa 
tatctctaca 
gtaactaaaa 



caaacatgtt 
atctacctga 
ggaattkatc 
atcaagtaca 
gctaaaaagt 
tgcaaattgt 
aaaaacatta 
actaaggaat 



tgctgattct 
aataatttaa 
tgaattcttg 
tttgtaatag 
cctctaattg 
agttagtagg 
caaataatat 
catttatagt 



aggacatatt 
atttatctta 
tgtttgcaga 
ttgaaataca 
tgatgctgaa 
gctaaaaaaa 
tttgtcgtat 
ttatcaggca 



gtcaatcacg 
tatttctctt 
gcacaacata 
aaagaagttg 
agaaactttt 
ttactaaatt 
gaagtgtgat 

g 



60 
120 
180 
240 
300 
360 
420 
471 



<210> 56 

<211> 499 

<212> DMA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 121 

<223> 99-25950-121 : polymorphic base G or C 
<220> 

<221> miscjbinding 

<222> 109. .133 

<223> 99-25950-121. probe 

<220> 

<221> primerjbind 

<222> 102. .120 

<223> 99-25950-121. mis 

<220> 

<221> prime r__bind 
<222> 122. .140 

<223> 99-25950-121. mis complement 



<220> 

<221> primerjbind 
<222> 1. .19 
<223> 99-25950. pu 



<220> 

<221> primerjbind 

<222> 479. .499 

<223> 99-25950. rp complement 



<400> 56 

ctaccaaatc 

accagaatta 

stgtagaaat 

ttgttgtgtt 

tattgtaagt 

attgggaaaa 

ttcaactctt 

aatttcccct 



atcaactacc 
tcacagagtc 
cttccttatg 
agaagtggct 
atcacatata 
agaatatagt 
ataggacaaa 
agatctcaga 



aaatacctca 
taaagttatt 
gattcaatat 
atctattgtt 
atccaggata 
ctgcaaatac 
ttcagaactc 
cagatgctga 



gtgcattttc 
tcacacttaa 
tttgggaaag 
actttgtaat 
ttttcctttg 
aaacattgaa 
aactaaatga 
gatctgtttt 



tagcattaac 
ttttgtagaa 
aagcctcagt 
aaattatctt 
gaagaggcaa 
aaatgtgacc 
aattctttat 
gtagcattca 



tttatttagt 
ataaatagat 
gatggatatt 
aaagggtgtt 
acccagttaa 
tctgcccctt 
attgttaaaa 
cccacgtacc 



60 
120 
180 
240 
300 
360 
420 
480 
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taaatctcac tctcagcaa 499 

<210> 57 

<211> 484 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 373 

<223> 99-25961-376 : polymorphic base A or C 
<220> 

<221> misc binding 

<222> 361.7385 

<223> 99-25961-376. probe 



<220> 

<221> primerjoind 

<222> 354. .372 

<223> 99-25961-376. mis 

<220> 

<221> primerjaind 
<222> 374. .392 

<223> 99-25961-376. mis complement 
<220> 

<221> primer__bind 
<222> 1. .20 
<223> 99-25961. pa 

<220> 

<221> primer_bind 
<222> 464. .484 

<223> 99-25961 .rp complement 
<220> 

<221> misc_feature 
<222> 395 

<223> n=a, g, c or t 



<400> 57 

tctcccctgt 

gttggaaaag 

caaggtctgg 

tccatgaaac 

aaatggkttc 

tcccagagct 

cctgggacgg 

acccttgaat 

ggct 



gaatgggaac 
gagggaaaga 
aagggtcact 
tatatgcagt 
cttgttgaat 
tagatttccg 
ggmtacacag 
gtggcagggg 



atgtatgaat 
cagacacacc 
ttctgccttc 
tggagcctga 
ttgtgagaag 
agaaaataaa 
acaacttata 
catggcaggt 



<210> 58 

<211> 470 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 399 



ttgaaattat 
cctggactca 
tttcccttcc 
tggtcttccc 
cctttcccca 
ctatgcagta 
tctcnaaact 
cactgtgaga 



gttctacatt 
gagggttctg 
attctttctt 
cttaccacct 
agtctcttgc 
ggaataggat 
ctggaagacg 
ttagcttctg 



ctcctccaca 
cctgcctcac 
caaataattg 
ccccaagtga 
tttgcttgaa 
tggtgtggct 
atcattgtaa 
ttaaactgag 



60 
120 
180 
240 
300 
360 
420 
480 
484 
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<223> 99-25965-399 polymorphic base A/or G 
<220> 

<221> misc_binding 

<222> 387.. 411 

<223> 99-25965-399. probe 

<220> 

<221> primerjaind 

<222> 380. .398 

<223> 99-25965-399. mis 

<220> 

<221> primer_bind 
<222> 400. .418 

<223> 99-25 965-3 99. mis complement 
<220> 

<221> primer_bind 
<222> 1. ,21 
<223> 99-25965. pu 

<220> 

<221> primer__bind 
<222> 451. . 470 

<223> 99-25965. rp complement 



<400> 58 

ctcaaaatac 

ggtttcgatt 

gcttttcagt 

ataatgatcc 

agaaagctga 

acatgaagtg 

aatgcgtact 

cttcctcatt 



atatacacca 
gaggatgctt 
gccagctcac 
aaatcagtaa 
ctcgattgta 
taaggagata 
ggtcacgtga 
aaaaaccaaa 



gctttaaatg 
agataagaaa 
ctttgtcaac 
ctttaaatga 
acgtttagct 
ataagttttg 
ctttcacaag 
ggtacaatgt 



ggaggccaag 
agcgcagagc 
ttcctaggtc 
tagaataaat 
actagagaag 
ctcattgtta 
gaaaatgcrt 
gggtactgtg 



ataacggaaa 
cccctttcct 
aaatgtcatg 
agagaaaaca 
atgaaagttc 
atttgtctca 
tattcaccac 
acttttagaa 



gaaagaatga 
gggtcaagtt 
ggaaaggaaa 
attggctgag 
aaagaactta 
tgattctgtc 
agcagatcat 



60 
120 
180 
24.0 
300 
360 
420 
470 



<210> 59 
<211> 523 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 241 

<223> 99-25966-241 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 229. .253 

<223> 99-25966-241. probe 



<220> 

<221> primer_ bind 

<222> 222.. 240 

<223> 99-25966-241. mis 



•<220> 

<221> primer_bind 
<222> 242. .260 

<223> 99-25966-241 .mis complement 
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<220> 

<221> primer_bind 
<222> 1. .21 
<223> 99-25966. pu 

<220> 

<221> primer_bind 
<222> 503.. 523 

<223> 99-25966. rp complement 
<400> 59 

caatagtgta aacagccaac cattaaaaac acataaaata agtttcataa agatgaaggt 60 

gacacaggag tccagactaa aawgtatgag tatgagttca ggaaaggctt tctaagaagg 120 

tattatttaa cttctgggca ttcatattta agtataggaa aagtcatgtc agtattcagg 180 

aaaaatctga gtcatgcttg atggtatgaa agtgtgaggc atgtattgag aatgttcaag 240 

rgtcggttga ctgccactcc ttcctccaag ctcatagcaa atcccacttc agcacggaac 300 

tgttcttaaa gtccacactt accgctcttg ctccctcatt ccccatgtgc cttgtttatt 3 60 

tagaggtaga tatattggtc aggtcaggca agctaagctt cagtaacaca tcaaccctta 420 

catttcagtg gcttcaccca gtgaagattt gt'ttctcact taggctccaa gggtatctta 4 80 

ggctggcagg agatcagctg cacatcatct tcactgactc tea 523 

<210> 60 

<211> 493 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 56 

<223> 99-25967-57 ; polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 44.* 68 

<223> 99-25967-57 .probe 

<220> 

<221> primer_bind 

<222> 37. .55 

<223> 99-25967-57. mis 

<220> 

<221> primerj>ind 
<222> 51. .15 

<223> 99-25967-57. mis complement 
<220> 

<221> primer jDind 
<222> 1..20 
<223> 99-25967. pu 

<220> 

<221> prime r__bind 
<222> 473. .493 

<223> 99-25967. rp complement 
<400> 60 

aacaagtgac atgacaaatg tacattttct ctatttggac tgttacacag agectrgatt 60 
gaaaaggacg aagagaagaa gcaggcacaa gagaacagct gggctattac tataattgga 120 
ggtgtgagaa agagaaggtc aagaactcct cccagttttc tggcaaaaac aagttaaaaa 180 
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cataaaaaaa 
gtcctagaag 
agaatggaga 
taagtgaaga 
tgattatact 
ggcattctga 



taaaaaaata 
gacaagctcg 
ttttgctgag 
acaaagttct 
agaaaagata 
ttg 



aattaaaagc 
tgagggatag 
ccataccgca 
gtgctagaga 
tatcaattac 



aaataaatgt 
gaaggtcaag 
cagatatcta 
taattattca 
ttttcataca 



gccaacgaca 
ggttttattt 
gtcagttgga 
agaaccattc 
aaacaaaaaa 



aagtcagaga 
cattgtatta 
aacataaatc 
ttaaacaaaa 
tgcgtatttt 



24 0 
300 
360 
420 
480 
493 



<210> 61 

<211> 530 

<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 200 

<223> 99-25969-200 : polymorphic base G or T 
<220> 

<221> misc_binding 

<222> 188. .212 

<223> 99-25969-200. probe 

<220> 

<221> primerjbind 

<222> 181.. 199 

<223> 99-25969-200. mis 



<220> 

<221> primerjoind 
<222> 201., 219 

<223> 99-25969-200. mis complement 
<220> 

<221> primer__bind 
<222> 1. .21 
<223> 99-25969. pu 



<220> 

<221> primerjbind 

<222> 510. .530 

<223> 99-25969. rp complement 



<:400> 61 

gttagtgact 

aaactgccac 

taaaataatg 

aagtaaactg 

acagatgccc 

acccataaag 

ggaaggagag 

cagagcaaaa 

accaacagca 



ccttctttta 
caggagattt 
tacactgtga 
tgtagcatak 
catgaatcac 
tcaatatcaa 
aatgatcttc 
tcagagttca 
aagagtgaag 



cacatttctg 
agctaaggag 
catattcaag 
aaatcctacc 
ctagaaatcc 
gttaaaagca 
acatgtcaat 
attcagaagc 
tcaggagtgc 



gtttcaaagc 
taccagctga 
tcaagtacag 
acattctcca 
tgtcaacata 
ttttttttgt 
gaaaatctga 
cataaggaag 
agaaaagcaa 



agattatcaa 
ttttgttcta 
catttatatg 
aaccagacag 
taacaaatta 
cctcatgtaa 
gaaaagggag 
caggggaggg 
acaccgtagc 



gtaaaatttt 
atgcagtctt 
ctacacagtc 
agggagaatc 
tatgtggtag 
aataaactct 
gggtacagaa 
agccatgctt 



<210> 62 

<211> 448 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 315 



60 
120 
180 
240 
300 
360 
420 
480 
530 
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<223> 99-25972-3X7 : polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 303. .327 

<223> 99-25972-317 .probe 

<220> 

<221> primer_bind 

<222> 296.. 314 

<223> 99-25972-317. mis 

<220> 

<221> priraer__bind 
<222> 316.. 334 

<223> 99-25972-317. mis complement 
<220> 

<221> prime rjbind 
<222> 1..21 
<223> 99-25972. pu 

<220> 

<221> primexjbind 

<222> 428.. 448 

<223> 99-25972. rp complement 

<400> 62 

gacctatgag ttcattggtt cattccactt aatctagaga attcatttta cctagacaga 60 

cactaaaaaa gacacacagg tgcacacaca cttaggcata aattgtcagt ttttagccca 120 . 

ggtctgtcct attcccatta ttacatataa ttggattaga aatgtcattt taaaattatt 180. 

aagctatagg taaggccttt ctttgctaca tatatatata atcttgactt ctttaaaatc 240 . 

tagcctagaa aacatttttg tttctttttc tctttttcct ggtcccttct ttagctccca 300 

agaatttcag aaaaytgttg cattttagat tattttgcca gctgcatact tttctgtcta 360 

atttgcagat gtcacttact tgataacact tcacacagct gtctaaccca agcttaaatg 420 . 

attttctgat ttctgaagaa gtggctac 448 

<210> 63 

<211> 544 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 143 

<223> 99-25974-143 : polymorphic base A or G 
<220> 

<221> miscjoinding 

<222> 131.. 155 

<223> 99-25974-143. probe 

<2W> 

<221> primerjoind 

<222> 124. .142 

<223> 99-25974-143. mis 

<220> 

<221> primer_bind 
<222> 144.. 162 

<223> 99-25974-143. mis complement 



173 



WO 01/40493 



PCT/IB00/01507 



<220> 

<22l> primer_bind 
<222> 1..21 
<223> 99-25974. pu 



<220> 

<221> primer_bind 
<222> 524. .544 

<223> 99-2597 4. rp complement 



<400> 63 

gctataatct ctccttcctt ggtagccttg gggttgttta ttttcttagg gttttaagtt 60 

atttttcttg gcctgaaaac tcatggtgcc tactgtcaga gctgcacaga acccagtgct 120 

cagccactgc gtgcctccaa gcrctgacat cctaacaaca gtctgggtct ctgaggtctt 180 

ggagcagagg gaggtgagac aaagataaca gggtctaaac tgtgcctact ggaagtctct 24 0 

gctctcaaat aataaaaaaa cttaaaatga acaacatttt actaaatata gcctgatcac 300 

tgcatttgca tgggggctgg attgatctgg aatgatcaga gctaacatga agctcttttc 360 

tggagatgtg ggacatcaaa gtcacatact aagaagggat gaagacattg cagtgtatcg 420 

gtctccctca ccctgtcctt . ctctagcggt accccctctc tgaatctttt cttactggcc 480 

acaaagaaca aaactctcag tgcaatgcca tgacaaattt ctccaggaag sytaaccaaa 540 

tcgt 544 

<210> 64 

<2ll> 540 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 311 

<223> 99-25977-311 : polymorphic base A or G 
<220> 

<221> misc_binding 

<222> 299. .323 

<223> 99-25977-311. probe 

<220> 

<221> primer_bind 

<222> 292.. 310 

<223> 99-25977-311, mis 



<220> 

<221> prime r_bind 
<222> 312. .330 

<223> 99-25977-311. mis complement 
<220> 

<221> primer_bind 
<222> 1..21 
<223> 99-25977. pu 

<220> 

<221> primer__bind 
<222> 520. .540 

<223> 99-25977. rp complement 
<400> 64 

caattctctc aaaattgtct caaataaaca agtattgagc catcgtgatg aattatgttt 60 
gttcacaggt tattccattt aattttctct tcccacctca ttaagatgta aatagtaact 120 
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cagcttcata gagaagactg 
aaaggcagaa gctgctcttc 
ttaggaaagc aacacttaag 
caagcccatg rttctgtgtt 
cggtaacgga ggtggaagtc 
aatattctgt tgcaccctgg 
agcaaaagaa ccatttaaaa 



aggctaagag aaaaggattg tccaggctga aaggttagaa 
tactcagatt tctgacttca attccacaaa ggtcttaaaa 
cactttaaat agctatgaaa aatgcaaaat ggaatctatt 
aattgtgctc tgatgattgt tctctttctt tgcagcccca 
agttctactt ctctgttgtt ttgcaaggaa tgttcctagg 
gaacattatc ttcatccgta ttttaagctt tttatagaca 
tatatatacc acagcttctg atacagtgtt gaaatactac 



180 
240 
300 
360 
420 
480 
540 



<210> 65 

<211> 538 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 93 

<223> 99-25979-93 : polymorphic base A or G . 
<220> 

<221> misojbinding 

<222> 81. . 105 

<223> 99-25979-93 -probe 

<220> 

<221> primer_bind 

<222> 74. .92 

<223> 99-25979-93. mis 

<220> 

<221> primerjaind 
<222> 94. .112 

<223> 99-25979-93. mis complement 
<220> 

<221> prime r_bind 
<222> 1..19 
<223> 99-25979. pu 

<220> 

<221> primer_bind 

<222> 518. .538 

<223> 99-25979. rp complement 

<220> 

<221> mis cofeature 

<222> 412,462 

<223> n=a, g, c or t 

<400> 65 

cactacagta atcggagaca atgcagtgac aggattgact acccgtggat cagcctgaag 60 

cttcaccttt tttttcaaag tgcattttga gartatttgt gctatccatt agtaaaaatt 120 

accaaagagg agcataatgg aagcatagag aaatagtaaa aatagaaaca tccagatatt 180 

ctatgaaccc taggaaattt acacaacagt attattattc aatatattgt tgtcgtgtgg 240 

caccaacgcc agagaagaat taattgctga gcaacggccc agacaggagt tcccaagaag 300 

taggttcagg tccatgagtt gggaacttgc tgagttgctg tggatctttg cttctttaaa 360 

gtgttgtcca tggattgcgg gcatcatcag tattacctgg gatcttcgta gnaatcacag 420 

aatcttggtc cctaacccag atctgcagaa tcagttactg cnggcttcac atgataccca 480 

ctggatgtgc atgtacattc aagtttgaaa tgatggtgtg aatcaggcag cagaaatg 538 



<210> 66 
<211> 469 
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<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 194 

<223> 99-25985-194 : polymorphic base C or T 
<220> 

<221> miscjoinding 

<222> 182. .206 

<223> 99-25985-194. probe 

<220> 

<221> primer Joind 

<222> 175. .193 

<223> 99-25985-194. mis 

<220> 

<221> primer_bind 
<222> 195.. 213 

<223> 99-25985-194. mis complement 
<220> 

<221> primer bind 
<222> 1..21 " 
<223> 99-25985. pu 

<220> 

<221> primer bind 
<222> 449. .469 

<223> 99-25985. rp complement 
<400> 66 



PCT/1B00/01507 



SJJSSS ££££ SSSIK 2ii^2 cta T ataa t9tttggaca 60 

aagtaaacac tttgtcac« tg?J£g?£ t a at ^ aattcataat 120 

tcagcataat atayattatt acattaacga caalaaaS ta * cctactt aggtattatt 180 

aagtatagac tgaactagat aagacaagfg aacattttS taS*^ atCt ^ tacaa 240 

tctgcacgtc actttaagac atggctgaac tatccL^ tagatttgaa ctcagagagc 300 

•tcctccctg tgaaact?aa aacata?taa tctctaalaa IrT ft** 9 * ttatt ^ fc 360 

ttttcatcct gagctaatgt atgotgtgct gaaaaStta Scccatt* \f 



<210> 67 
<211> 553 
<212> DNA 

<213> Homo sapiens 
<220> 

<221> allele 
<222> 398 

<223> 99-25989-398 : polymorphic base A or G 
<220> 

<221> misc__binding 

<222> 386. .410 

<223> 99-25989-398. probe 

<220> 

<221> primer_bind 
<222> 379.. 397 
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<223> 99-25989-398.mis 
<220> 

<22l> primer__bind 
<222> 399.. 417 

<223> 99-25989-398. mis complement 
<220> 

<221> primerjbind 
<222> 1. .19 
<223> 99-25989. pu 

<220> 

<221> primer_bind 

<222> 533. .553 

<223> 99-25989. rp complement 



<4O0> 67 

cccatcttat 

aattagtatt 

gaagtgcaat 

gtacatatgq 

ctccgacccc 

tgtattccaa 

taactcaaat 

gactttgcta 

aaatttctag 

aaaacaactg 



cttcaaatcc 
cttagataat 
gccactgagt 
ttaacaataa 
cacatgtaaa 
gacagtaatt 
aacagcgata 
accatactca 
aggtaaggce 
tga 



ccatgttaca 
gcagacattt 
ttaccctgaa 
aaggtaaaat 
ctaattccag 
cttgagagct 
attataataa 
agcatactg-t 
ctggcctcat 



ctataggtgc 
aaaaagcaca 
atcatgaaaa 
ctctcaccat 
ggtaattgca 
atttgcatca 
tcattacrta 
ctttccaaga 
tcagattatt 



agtaagtata 
agtcattttt 
tggcttgatg 
cacttttggt 
gggtgatagt 
caactactat 
tttactgagt 
tacaaatgtt 
tcagttcata 



aaacaaacct 
ggaaaaegaa 
gagtgctaat 
agtcatttct 
tttgattttc 
cagtggctac 
attcactatg 
ttagaaaatg 
ttgggattkr 



60 
120 
180 
240 
300 
360 
420 
480 
540 
553 



<210> 63 

<211> 451 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 264 

<223> 99-26146-264 : polymorphic base G or T 
<220> 

<221> misc_binding 

<222> 252.. 276 

<223> 99-26146-2 64 .probe 



<220> 

<221> prime rjbind 

<222> 245.-263 

<223> 99-26146-264 .mis 

<220> 

<221> prime r_bind 
<222> 265. .283 

<223> 99-26146-264 .mis complement 
<220> 

<221> primerjDind 
<222> 1..19 
<223> 99-26146. pu 



<220> 

<221> primer_bind 
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<222> 436*. 456 

<223> 9 9- 2 61 4 6. rp complement 



<400> 68 

ttttacaccc 

gtttaagtga 

ttgggatttg 

atgaaacaaa 

tcttggctac 

ttacttcctt 

accttagtaa 

accatctgta 



atcaaaacct 
atcaattaag 
ttactagtat 
tctgagccaa 
attatttaat 
ataattacac 
acctatttta 
gttyyttgca 



aaaaatcctfc 
aggaaatgaa 
gtgtattttt 
aaagcaggtt 
atcktttaca 
atcataaatt 
atatcaaaaa 
tgaaaaatag 



catacatttt 
tactcgactt 
attatgacac 
ttgaaatcaa 
tctacagatg 
caaactattg 
aatcccaaat 

g 



aatatctacc 
atteagaccg 
aactaatgca 
cttcacttat 
cttatcacct 
taagttgaaa 
tgaactatct 



caggaaatga 
aagttgttat 
gtgatgtgtt 
aaaggcatta 
taagatggga 
tgcatttagt 
taagttgggg 



60 
120 
180 
240 
300 
360 
420 
451 



<210> 69 

<211> 454 

<212> DMA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 276 

<223> 99-26150-27 6 ; polymorphic base A or G 
<220> 

<221> iuisc_binding 

<222> 264.. 288 

<223> 99-26150-27 6. probe 

<220> 

<221> prime r_bind 

<222> 257. .275 

<223> 99-26150-276. mis 

<220> 

<221> prime r_bind 
<222> 277.. 295 

<223> 99-26150-276. rais complement 
<220> 

<221> primer_bind 
<222> 1..19 
<223> 99-26150, pu 



<220> 

<221> primer_bind 
<222> 437. .454 

<223> 99-26150, rp complement 



<400> 69 

gaaagattag 

aagggatcca 

ga caeca get 

ctcagtttga 

tgtggtcatt 

catactaact 

tgtcagtttg 

agatgeaaag 

<210> 70 
<211> 449 
<212> DNA 



ttgeatgaga 
agaacagtta 
tgttggagga 
ggaaggecta 
tecaeggaga 
tttcatattt 
atacattgtt 
aaatatggta 



taatctgatc 
ctgeaatatt 
gaatgaattg 
ttgagacatt 
tggagataaa 
tgaggtttct 
tttctctgtt 
cttttggtgg 



tctaaaattc 
cgtaccacaa 
egageggaga 
tggttcatgt 
tgagcrtatt 
aaagttgaac 
aaaaacatat 
caca 



tgecttttta 


atattttatc 


60 


teaactttge 


aatttgaaaa 


120 


gagtcagaga 


ggagataaga 


180 


gacaatctgg 


tgttttgtac 


240 


ttggacagtt 


tatttaacag 


300 


gcatcttcta 


attgatgaca 


360 


gctatgatac 


atggcatctg 


420 






454 
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<213> Homo sapiens 
<220> 

<221> allele 
<222> 44 

<223> 99-26153-44 : polymorphic base A pr C 
<220> 

<221> miscjDinding 

<222> 32. .56 

<223> 99-26153-4 4. probe 



<220> 

<221> primer_bind 

<222> 25. .43 

<223> 99-26153-44. mis 



<220> 

<221> primer_Joind 
<222> 45. .63 

<223> 99-26153-44. mis complement 
<220> 

<221> primer Jbind 
<222> 1. .19 
<223> 99-26153. pu 



<220> 

<221> primer_bind 
<222> 429. .449 

<223> 99-26153. rp complement 



<400> 70 

atgcttaatg 

tcagggtata 

atagtgtaca 

gcactgaact 

ataatatttt 

gtaatttctc 

aaacagagaa 

catgagatgg 



cacatctacr 
aaatctttgc 
gcaattaact 
tcgctggtca 
tatatgtcaa 
aaaaacaaag 
tgagggaaga 
tataaagagg 



taatctatat 
atactacgtt 
gctgtcaagt 
ataagtgcat 
ttgctcctct 
aaacggaagg. 
agaaaggaag 
caggtaatg 



aaatcaatca 
aacatagtat 
tttatctttc 
tatatacaca 
agattgagtc 
aaagaaggaa 
tgctgaaaat 



atcmatctat 
acaagttcct 
ctgtaaaact 
cccatatgca 
caatgctcag 
gggggatgga 
gtatttgggg 



gtctacctgt 
tgtgcaacag 
cccagcacct 
catacatwat 
aattataacg 
gagaagtcag 
tgaataaaca 



60 
120 
180 
240 
300 
360 
420 
449 



<210> 71 
<211> 511 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 290 

<223> 99-26156-290 : polymorphic base A or C 
<220> 

<221> misc_binding 

<222> 275. .302 

<223> 99-26156-290. probe 

<220> 

<221> primer_bind 

<222> 271. .289 

<223> 99-26156-290. mis 
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<220> 

<221> primer_bind 
<222> 291. .309 

<223> 99-26156-290 .mis complement 
<220> 

<221> primerjoind 
<222> 1. .18 
<223> 99-26156. pu 

<220> 

<221> prime r_bind 
<222> 491.. 511 

<223> 99-26156. rp complement 
<400> 71 

gatgagtgtg aggtagtgtc tcattttggt tttgagtttt cctaatgatt actgacattg 60 

aaaatctttt tataatgtct atatttataa aattatattt atatagacaa agacccccaa 120 

aaacatattt gtctagggcc ttgtagaccc aaggggaagc cctgatggat tcgaatccta 180 

attccaccac ctattagctg -tgtagtgttg aatatatctg taaacctact tgagtatcag 24 0 

aatttttata ttggctgcat tttaactgta gatagctgtt ctggagcaam aatactgtaa 300 

aattagtctt ttcttgaaaa- acaggcagaa cctagtattg acagatataa ctattttttg 360 

cacccctctt ctatatctta tgctattgtc cacgagctta ttatattgta acttgatatt 4 20 

aattttaaga aagatttgaa attagccata ttttagtatg tcagttttcc ataacaagga 480 

aaaaaaaaca cagaataaag atttaagcca c 511 

<210> 72 

<211> 551 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 189 

<223> 99-27345-189 : polymorphic base G or C 
<220> 

<221> misc_binding 

<222> 177 ..201 

<223> 99-27345-189. probe 

<220> 

<221> primer^bind 

<222> 170.. 188 

<223> 99-27345-189. mis 

<220> 

<221> primer_bind 
<222> 190.. 208 

<223> 99-2734 5-189. mis complement 
<220> 

<221> primer_bind 
<222> 1. .18 
<223> 99-27345. pu 

<220> 

<221> primer_bind 
<222> 531. .551 

<223> 99-27345. rp complement 
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<400> 72 

taacccacac ggatttagct agagttctag ttagctcatg agtagggaca gaactgggtt 60 

tggagttaat tttactctgt acttatgact taaaaccgga ttggaattac cttctgagta 120 

agtgttccca tattaccata tctaaattag gtgtttcatt ttttataact acggtagcat 180 

cttctcagsa tcaactatat aatttttagg gcccagtgaa aactgaaact atgatgtcct 240 

tcactagaaa atgattcaga attttcagga ggtgccagca gagtgttagt ccaaaagcag 300 

gtccttctac aggacccaga atgatggcac agcctgggaa acaataaagc tggccctgcc 360. 

cccttcctca gtatttatca gaacttgtag ttgtaatttt tataaaatac gtacggtaat 420 

ttttctgttg catttgcttg ttttctggaa aactgcaagc ttcacaacgt gagtgacatg 480 

catactgtat ttcttgggtc agctcagcat ctagcacagt gcctgatgca gcagmcaccg 540 

ataaattctg t 551 

<210> 73 

<211> 514 . 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 380 

<223> 99-30329-380 ; polymorphic base C or T 
<220> 

<221> miscjbinding 

<222> 368.. 392 

<223> 99-30329-380. probe 

<220> 

<221> primerjbind 

<222> 361.. 379 

<223> 99-30329-380.mis 

<220> 

<221> prime r__bind 
<222> 381. .399 

<223> 99-30329-380. mis complement 
<220> 

<221> primerjbind 
<222> 1. .18 
<223> 99-30329. pu 

<220> 

<221> prime r_bind 
<222> 496. .514 

<223> 99-30329. rp complement 
<400> 73 

gacatacccc aaacacagca gcacaaagca actgcctgtg attgtcaatt atctgcacta 60 
actttgaata gacaactgag agtaaaattg accattagct gttattaatc ttttaaatag 120 
cttatttatc aggacaatat tttctttgag gaatggtttc acatcgttga taaacagttt 180 
cctattacta acattaaact aaataatata tgtcatattt cttttttacc cttagaaagt 240 
taatctaata atattattct gattagtggt acttttttta atttttgtaa tgttgtttat 300 
ggtgtttgat ttgtactttt ttgtcttttt catacctttt aataacatcc gatttactac 360 
tggagcattt tattttcaty gtctcttaaa atacttaata acttttagat ttgttgttca 420 
caaaactaaa taattgcttg atgtttcatt ttatccttct tattaaatag gcagctcgtg 4 80 
tacatatgaa agtgtctttt tatacaccga aacc 514 

<210> 74 
<211> 511 
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<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 342 

<223> 99-31798-344 : variable motif CAT or ACTGTCTCTACCTCA 
<220> 

<221> primer_bind 
<222> 1. .20 
<223> 99-31798. pu 

<220> . 

<221> primer_bind 
<222> 491. .511 
<223> 99-31798. rp complement 

<400> 74 

caacccataa caaatggaag gcagagccca 
tccccacagc ctagaagttg gggagagagg 
catagaggaa ggtaagaagg taagatacct 
aggaaaattg .catgatctca tactaggtga 
tgcgagctga* tctaagccaa ctccaacatt 
acttgtaggt aagggctaca tatattcttc 
gcatttatat ccttattata gcagaggfcag 
cctcccaaag ggtaaacatt attagcaagc 
tttttaagtg cagaaatatc tccactccgt 

<210> 75 
<211> 474 
<212> DNA 
<213> Homo sapiens 

<220> 

<221> allele 
<222> 159 

<223> 99-5873-159 : polymorphic base C or T 
<220> 

<221> miscjt>inding 
<222> 147* .171 
<223> 99-5873-159. probe 

<220> 

<221> primer^binci 
<222> 140. .158 
<223> 99-5873-159. mis 

<220> 

<221> pr inter Jbind 
<222> 160. .178 

<223> 99-5873-159. mis complement 
<220> 

<221> prime rjbind 
<222> 11. .28 
<223> 99-5873. pu 

<220> 

<221> primer Jbind 



atggcaaagg tagtgccaag 
aggcctaaaa tgcttccaga 
taagattcgg gcatggggaa 
cctcagtacc agtgaagttt 
caacaatggt aagggaaccc 
attacaacct gactgtctct 
ttatgattaa agtaactatc 
aggaaggaaa caagatatgt 
c 



gcagatgtgt 


60 


ctgggtctac 


120 


ggggcagaga 


180 


tgattggatt 


240 


ttagraattc 


300 


acctcattat 


360 


agactgaaat 


420 


ctaggagaga 


480 




511 
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<222> 466. .474 

<223> 99-5873 .rp complement 



<4 00> 75 

aaaggagcag 

actaaaaaag 

ttaaaaaaga 

acatcctttg 

cagattagat 

ctagtgtaac 

attaatatca 

taatgaataa 



gcgtaacaat 
tctaatgaaa 
aaaaaaatag 
aaagaacaaa 
attgttaaaa 
catcaaagat 
taaaaatata 
taatatttaa 



aagcagggtt 
tcagtagtct 
gtatgaaaga 
gagttattca 
agaggaatag 
gtattgcaaa 
ctttaagaag 
aaaaaagtat 



agtcgccaca 
taaaaagatg 
gagtaacaya 
aattgaattc 
ggaattaaat 
tgataaagaa 
taaataaatg 
aacaagtata 



aaacttgaga 
acatgatagg 
taccggaaaa 
ttaatgaatt 
gatatatgtg 
aaaaatgctg 
caactaggaa 
cataagatgt 



taagaggaaa 
. aagagaagtg 
gggataaaat 
acttaaacag 
atgatattac 
ccatggcaat 
tagagaaagc 
aata 



<210> 76 

<211> 490 

<212> DNA 

<213> Homo sapiens 

<220> 

<221> allele 
<222> 220 

<223> 99-6012-220 ; 



polymorphic base G or T 



<220> 

<221> misc__binding 

<222> 208.. 232 

<223> 99-6012-220. probe 

<220> 

<221> primer_bind 

<222> 201. .219 

<223> 99-6012-220. mis 



60 
120 
180 
240 
300 
360 
420 
474 



<220> 

<221> prxmer__bind 
<222> 221. .239 

<223> 99-6012-220. mis complement 
<220> 

<221> primer_bind 
<222> 11.. 29 
<223> 99-6012. pu 

<220> 

<221> primer_bind 

<222> 477.. 490 

<223> 99-6012. rp complement 



<400> 76 

attgtattta 

aaatatacac 

cagtgagctg 

aaggtgagga 

acatttctag 

atatatctgc 

aaaggcacca 

tttggctgtc 

tatacaccca 



gtcttgactt 
gtcttctcag 
ctgctcccag 
atttcctgat 
gcataaaaaa 
caacaaagac 
tacttagtgt 
tttagttgtt 



gttttcctga 
ttttaataat 
gtcagtctgc 
tttagagatt 
atgtacagca 
cacatggagt 
tttatttcca 
tctaaagtag 



gggtccaggt 
tgactgacag 
aaaggatgct 
tctttatcck 
ctctactgct 
atttcattga 
tgagtgaagg 
tgctgatttt 



tgatttgcat 
ccctgtggtt 
ggttcccttg 
aattttgaag 
tgtttaacaa 
ctatcagaga 
aaaattagtt 
atatgcccat 



gctcttgagg 
tctcaggacc 
tggtctcatc 
actttctttc 
atggatagtg 
agtttcctcg 
atttgaagta 
aatattcata 



60 
120 
180 
240 
300 
360 
420 
480 
490 



<210> 77 
<211> 529 
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<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 99 

<223> 99-6080-99 : polymorphic base C or T 
<220> 

<221> miscjainding 

<222> 87. .111 

<223> 99-6080-99. probe 



<220> 

<221> prime r_bind 

<222> 80. .98 

<223> 99-6080*99. mis 

<220> 

<221> primerjbind 
<222> 100. .118 

<223> 99-608 0-99. mis complement 
<220> 

<221> prime rjbind 
<222> 11.. 28 
<223> 99-6080. pu 

<220> 

<221> primer_bind 

<222> 519. .529 

<223> 99-6080. rp complement 



<400> 77 

acaacaacaa 

ttgattacaa 

gcaaaacaat- 

gtttatgcta 

ctttatatga 

tctattacaa 

ttgtcacata 

caaaaagtga 

ttaattactc 



aaatgtgtcc 
agccatctct 
agctcgtgtt 
tttcctataa 
aaaaattatt 
gtttattttt 
ttaaaacatg 
gttggccaga 
aactccacca 



ctgaaaccca 
agcaatttaa 
caaggtatgt 
actagattct 
gaaaaatcta 
aatttcaaaa 
tacattattg 
tagtaaataa 
ctattttgcc 



tgctatattc 
tacaattayg 
ctttattagg 
aaaatatttt 
caagatggtg 
atgtactgca 
tgtgcaaatt 
tttaggctct 
aaagcagtca 



aactgaatat 
aaatggaaaa 
ggaagtttat 
attctataaa 
aaactcttta 
taaatgcagc 
aaaatttcat 
aaggctgaaa 
cagacaata 



tctaatgtct 
gttggcaaat 
cgaaacagat 
gatgtattga 
gaactatatt 
aaaaccttta 
taccttaaac 
agcgcttgta 



60 
120 
180 
240 
300 
360 
420 
480 
529 



<210> 78 
<211> 489 
<212> DNA 

<213> Homo sapiens 



<220> 

<221> allele 
<222> 156 

<223> 99-7308-157 t polymorphic base C or T 
<220> 

<221> misc_binding 

<222> 144. . 168 

<223> 99-7308-157. probe 



<220> 

<221> prime r_bind 
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<222> 137. .155 

<223> 99-7308-157 .mis 



<220> 

<221> primer_bind 
<222> 157.. 175 

<223> 99-7308-157 .mis complement 
<220> 

<221> primer__bind 
<222> 1..18 
<223> 99-7308. pu 



<22G> 

<221> primerjbind 

<222> 469. .489 

<223> 99-7308. rp complement 



<400> 78 

tgtggtctgg 

ttgcattatt 

ggtatatgta 

ttttaagtaa* 

ctaaacactg 

ctgtcccata 

ttttctttcc 

ccaatttcca 

ggcaactat 



atatggtgra 
cctgaggggc 
ttcatggtga 
ttttcctgaa 
gcatatgtgt 
tttttgtgta 
tgataaataa 
attctttttt 



ctgtccttca 
aatgcattcc 
caaatttatt 
tgtgcctcat 
tcagatttca 
aagttagaca 
caattcacat 
caccatgtat 



cacacagatg 
agttacatag 
cacatyttaa 
tggcttctgt 
acattccgtt 
ataaagtgtt 
atctttttaa 
ctattggagt 



tgggaagcca 
aaccagtttc 
gtaattttaa 
gcctcttcag 
gttttcattg 
aatattctgg 
aatatcagag 
tttaaaatga 



tgatcatcag 
tacgtttcag 
gtaattcaca 
aaaagatgaa 
tggataattt 
cgtcggcaca 
aatatagtaa 
eta at act a a 



60 
120 
180 
240 
300 
360 
420 
480 
489 



<210> 7 9 
<211> 112 
<212> DNA 

<213> Pan troglodytes 
<400> 79 

tcccagtggg attcaaccac tcatagactg cttttgttca aactgegtte aaataaggca 60 
aacgccgagc tggaaccaat ctcactgttt ctgtacctcg cttctaattc ct 112 

<210> 80 
<211> 118 
<212> DNA 

<213> Pan troglodytes 
<400> 80 

tttcagaaaa caaaaactcc taagagaatg aatagaaatt atcttaagga attataaaga 60 
accatctcag aattttactc tgttactgca aaatctcatg caaagataac caagtatg 118 



<210> 81 
<211> 146 
<212> DNA 
<213> Pan 



troglodytes 



<400> 81 

ttgeaggtec acattegtte tagaggctgc ctgcggtcct tagaggecac tcatagttcc 60 

ttgccatgaa gacttttcca acatggctgc ttacttctta aagcctgtaa ggagatccag 120 

tctcagttgg ccaagtcttg taaaag 146 



<210> 82 
<211> 94 
<212> DNA 

<213> Pan troglodytes 
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<400> 82 

gaaaataacg catttataaa acccagctca gatgttactt ccaaccaaca aatttcctgg 60 
acacactaag taatagagtt aatcacactg tcct 94 



<210> 83 
<211> 229 
<212> DNA 

<213> Pan troglodytes 



<400> 83 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggccac tcatagttcc 60 

ttgccatgaa gacttttcca acatggctgc ttacttctta aagcctgtaa ggagatccag 120 

tctcagttgg ccaagtcttg taaaaggtaa cctcatcacc ggagtcacgt ctcatcacct 180 

ttcccatgtt ccattggtta gaagtcccag ctcctgccca cattcaagg 229 

<210> 84 
<211> 205 
<212> DNA 

<213> Pan troglodytes 



<400> 84 

tgaaggtctc cctgctgtgc ttagagaggg ggatgctgga agaagagtgg tcagagaagc' 60 

aacattgctg gctctaaaga cagaggacag gccacgagcc aaggcatgca ggaggcctcc 120 

agaagctggg gcaggcaaga gaatggactc agccctggag cctccagaag gcacgcagcc 180 

ctgcccattc cttgctttta gccag 205 

<210> 85 
<211> 82 
<212> DNA 

<213> Pan troglodytes 
<400> 85 

aaacaatacg aacagaaaca aagctatgta cttgttttgg gaagcaacct caaatggaaa 60 
gatgggggct gtgttctaaa ag 82 

<210> 86 
<211> 234 
<212> DNA 

<213> Pan troglodytes 
<400> 86 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cacattgcac tgaggccgcc 60 
atgctgtaaa aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 
cttttccagc attttgggtc ctcccagcta aatcatcggc attgtgaagc agagatgaga 180 
catcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac acta 234 

<210> 87 

<211> 714 

<212> DNA 

<213> Pan troglodytes 



<400> 87 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cacattgcac tgaggccgcc 60 

atgctgtaaa aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc attttgggtc ctcccagcta aatcatcggc attgtgaagc agagatgaga 180 

catcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac actagtgagc 240 

ttttacttta agccactaat ttggggtgta gctcttttgc agcaaaagat aatcagaaca 300 

ctgccatgca ggagctgaga aacttgctca ctgttatgcg tgtatccaat gactactcac 360 

acacacatta cttgcttcac tctgctttct ggctttccat tcagcaaatt ttaagaaatg 420 

aaaaagttat tcatgggaaa ttaactcaag atgaatattg ttgtagttaa tctctttgta 480 
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aatggtaaaa tattaaatta ttgtgagtca tttttattta aagaagtttg gttctttaaa 540 

tgaaaaaagt ttgggtcttt tgtggatact ttgcatatag acattctaga catgaaagaa 600 

tgttgaagac aaagaatgtt gaaggaatat agacatgaaa gaatgttgaa gtttggtttt 660 

ttgtggatac tctgcatata gacattctag actggagtga tacatcttgt tcag 714 

<210> 88 
<211> 220 
<212> DNA 

<213> Pan troglodytes 
<400> 88 

acatgaaaga atgttgaagt ttggtttttt gtggatactc tgcatataga cattctagac 60 
tggagtgata catcttgttc aggtttctat gttttcaaaa tgataacttg acacacctac 120 
caatgccaat ttttaggata tatatctgta tatctgtgtg aactaaggaa aattcaggta 180 
tgattaaacc tacatatttt gccaggagag actggataat 220 

<210> 89 

<211> 173 

<212> DNA 

<213> Pan troglodytes 
<220> 

<221> misc_feature 

<222> 91.. 92, 131 

<223> n=a, g, c or t 

<4O0> 89 

ggtcctgatc ttcccagtga cttagcagca cagtaatttg gctgaaatgt tgatgagttc 60 

agacaagaag aaggtgatga tgacaagaga nncgtcacgg ggaaattctg ccctgaggaa 120 

ggaatagttt naggctggag agatgtgcta cccccgccac tctttctcaa atg 173 

<210> 90 
<211> 1105 
<212> DNA 

<213> Pan troglodytes 
<220> 

<221> mis cofeature 

<222> 201,744,803,808.-809 

<223> n«a, g, c or t 

<4O0> 90 

gaaagacaac ctgtaggcac agaaatcatt gctctgctta cactgaaatc tcctgctgac 60 

tttgagagct gagcactgga ttttgatggg agagtgaaac ttagtgaagg aatgatttga 120 

acctttgttg atctaatact tctgtgatac agttatctaa acagaatgag tctaaagtca 180 

gtataaagac ctgtgtctct ncatgttttg ctacttataa ttgcagagga acattttgag 240 

atagaaaatt ttatccccct ttcaagtgac tttttataaa acataacctt ttaagagagg 300 

tcattctttt gaaaaattgt ctgaatacaa gagaagtaac tgaacagcaa gggattgtag 360 

ccataatgtt cttgaccact ttagattgac aaatagtgtc tcaaaattga tcaggttctg 420 

gagaatctat tgaggttgta tggagtaacc aagtgatgga agaggttgga agtgaataat 4 80 

tttatgtggt attatgtgat tttttttggt gggggtagtg tctggagctg agcatgatgg 54 0 

ggcctagatg gggcagcctt ctaatgctcc cagtatttcg caaatcatat gtcacgtcaa 600 

aattgtaatg acaaagctca cagcagctgc ctacgccagg tttggtgcca gctctaatta 660 

gagcatcgat catttctctc cccttcatga atgcacgtgc gtgcacattt tcatatacaa 720 

gggcaaagga catttcaaag tttnggatgg aatctttcct acttagaagt aataaaaagc 780 

aatactttga aatcttacca aanattannc aaatagcaac aatttttcca tttaaagatg 840 

tgtgtcttca ggtctgttgg atctttcaaa tttctatttt ggacatggaa actacagatg 900 

aaatttaact tactcaaagt tacacagaac tcctggctga tctcttttac ttaattcttc 960 

taaaatttta agtaagaata attttgaaat gttattggta atgcttaaat aacaaaagtt 1020 

actttaatat gttacatgtt ggaagtttca tttatactta atatctaatt taactcttgt 1080 

gaatatatta ttcagttttt ttttt 1105 
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<210> 91 
<211> 114 
<212> DNA 

<213> Hylobates sp. 
<220> 

<221> misc_feature 
<222> 45,75 

<223> n=a, g, c or t . • 

<400> 91 

tcccagtgag attcaaccac tcatagactg cttttgtcca aactngcgtt caaataaggc 60 
aaacgccgag ctggnaaccc atctcactgt ttctgtacct cacttctgat tcct 114 

<210> 92 
<211> 117 
<212> DNA 

<213> Hylobates sp* 



60 
117 



<400> 92 

ttttagaaaa caaatctcct aagagaatga atagaaatta tcttaaggaa ttataaagaa 
ccatctcaga attttactct gttactgcaa aatctcatgc aaagatagcc aagtatg 

<210> 93 
<211> 146 
<212> DNA 

<213> Hylobates sp. 
<400> 93 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggccac tgatacttcc 60 
ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120 
tctcagttgg ccaagtcttg tgtaag 146 



<210> 94 
<211> 229 
<212> DNA 

<213> Hylobates sp. 



<400> 94 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggccac tgatacttcc 60 

ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120 

tctcagttgg ccaagtcttg tgtaaggtaa cctcatcacc ggagtcacgt ctcatcacct 180 

ttcccaagtt ccattggtta gaagtcccag ctcctgccca cattcaagg 229 

<210> 95 
<211> 205 
<212> DNA 

<213> Hylobates sp. 
<220> 

<221> m.isc_feature 
<222> 187 

<223> n=a, g, c or t 
<400> 95 

tgaagttctc cctgctgtgc tcagagaggg gggtgctgga agaagagtgg tcagagaagc 
aacattgctg gctctaaaga gagaggacag gccacgagcc aaggcatgca ggaggcctcc 
agaagctggg gcaggcaaga gaatggactc ggccctgcag cctccagaag gcatgcagcc 
ctgcccnttc cttgctttta gccag 



60 
120 
180 
205 
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<210> 96 
<211> 234 
<212> DNA 

<213> Hylobates sp. 



<400> 96 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cagattgcac tgaggccacc 60 

atgctgtaag aaacctcaag ctaccaacct gaaaagctgg agagaatcag cacccatcag 120 

cttttccagc cttttgggtc ctcccggcta aatcataggc attgtgaagc agagatgaga 180 

cctcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac aata 234 

<210> 97 
<211> 715 
<212> DNA 

<213> Hylobates sp. 



<400> 97 

gaacattcac 

atgctgtaag 

cttttccagc 

cctcctgcca 

ttttacttta 

ctgccatgca 

acacacattt 

aaaatgttat 

aatggtaaaa 

tgaaagaagt 

tgttgaagac 

tttgtggata 



tctgggagct 
aaacctcaag 
cttttgggtc 
tgccctttcc 
agccactaat 
ggagctgata 
cttgcttcac 
tcatgggaaa 
tattaaatta 
ttgggtcttt 
aaagaatgtg 
ctctgcatat 



ctgtgctgcc 
ctaccaacct 
ctcccggcta 
aaatcctgga 
ttggggtgta 
aacttgctta 
tctactttct 
ttaactcaag 
ttgctaatca 
tgtggatact 
gaaggcatat 
agacattcta 



atgcaaatgt 
gaaaagctgg 
aatcataggc 
ccaacagaat 
gttcttatgc 
ctattatgtg 
ggctttccat 
atgtatattg 
tctttattta 
ctgcatatag 
agacatgaaa 
gactggagtg 



cagattgcac 
agagaatcag 
attgtgaagc 
catcatatac 
agcaaaagat 
tatatccaat 
tcagcaaatt 
ttgtagttaa 
aagaagtttg 
acattctaga 
gaatgttgaa 
atatatcttg 



tgaggccacc 
cacccatcag 
agagatgaga 
aataggaagc 
aatcagaaca 
gactactcac 
ttaagaaatg 
actctttgta 
gttctttaaa 
catgaaagaa 
gtttggttct 
ttcag 



<210> 98 
<211> 213 
<212> DNA 

<213> Hylobates sp. 
<400> 98 

acatgaaaga atgttgaagt ttggttcttt tgtggatact ctgcatatag acattctaga 
ctggagtgat atatcttgtt caggtttcta tgtttttaaa atgataactt gacacaccta 
ccaatgcaaa tttttaggat atatatctgt gtgaactaag gaaaattcag gaatgattaa 
acctacatat tttctcagga gagactggat aat 

<210> 99 

<211> 209 

<212> DNA 

<213> Hylobates sp. 
<220> 

<221> misc__feature 

<222> 26^27 

<223> n=a, g, c or t 

<400> 99 

ggttttgtgc atttctaaga tgatcnnctt ggtgtggggt cctgatcttc ccagtgactt 
agcagcacag taatttggct gaaatgttga tgagttaaga caagaagaag gtgacgctga 
caagagagcg tcaaggggaa attctgccct gaggaaggaa tagtttgagg ctggagagat 
gtgctacccc cgccactctt tctcaaatg 

<210> 100 
<211> 1415 
<212> DNA 

<213> Hylobates sp. 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
715 



60 
120 
180 
213 



60 
120 
180 
209 
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<220> 

<221> mis cofeature 

<222> 343,383,387,570,1118 

<223> n-a, g, c or t 

<400> 100 

gaaagaaaac ctgtaggcac agaaatcatt gctctgcfcta cactgaaatc tcctgctgac 60 

tttgagaggt gagtgttgga ttttgatgga agggtgaaac ttagtgaaga aatgatttga 120 

acctttgttg atcgaatact tctgtgatac agttatctaa acagaatgag tctaaggtca 180 

gtataaagac ctgtgtctct tcgtgttttg ctacttataa ttgcagagga acgttttgag 240 

atagaaaatt ttatcccact ttcaagtgac tttttataaa acataacctt ttaagagagt 300 

tcattctttt gaaaaattgt ctgaatacaa gaaaagtaac tgnaacagca aggattgtag 360 

ccataatgtt cttgaccact ttnagantga caaatagtgt ctcaaaattg atcaggtgcc 420 

acagaatcta tagaggttgt atggagtaac caactgatgg aagaggttgg acacatctga 480 

ataattttat gtggtattat gtgatgttat tttggtgggg tagtgtctgg agctgagcat : 540 

gatggggcct agatggggca gccttctaan gctcccaaca tttcccaaat catatgtcat 600 

ctcaaaattg taatgacaaa gctcacagca gctgcctaca ccaggtttgg tgccagctct 660 

aattagagca tcgagcattt ctctcccctt catgaatgca catgcgtgca cattttcata 720 

tatgagggca aaggacattt caaagtttgg ttggaatctt tcctacttag aagtaataaa 780 

aacaatactt tgaaatctta ccaaatatta aacaaatagc aacaattttt cc.atttaaag 84 0 

gtgtgtgtct tcaggtctgt tggatctgtc aaatttctat tttggacatg gaaaccacag 900 

atgaaattta acttacacaa agttacacag aactcctggc tgatctcttt tacttaattc 960 

ttctaaaacc taaagtaaga ataattttga aattttattg gtaatgctta aataacagtt 1020 

actttaatat attacatgtt tgaagatttt catttatact taatatcgaa tttaacttgt 1080 

gaaaatatta ttcagtattt tgttttgttt tgttttcnaa aatgacttgt tcagggctcc 114 0 

ttttgggatt ttaagtgaga aaaactctat ttcataatat tgttaccatg ctcatccaat 1200 

aacacttcat agatggttag ccatctctaa aattaaattt tgtagttgag ggccattgaa 1260 

aataccacct atattgggat ttctgtttaa cacttcaaaa tgtatcatta tcataattta 1320 

aaattagatt taaatacaag cctctttata gaaaacctga taaatatggc tttatttgtt 1380 

tgcttgtttt ttttgcaggg agggatattc ccttg 1415 

<210> 101 
<211> 112 
<212> DMA 

<213> Gorilla gorilla 
<220> 

<221> nds cofeature 
<222> 99 

<223> n-a, g, c or t 
<400> 101 

tcccagtagg attcaaccac tcatagactg cttttgttca aactgcgttc aaataaggca 60 
aacgccgagc tggaaccaat ctcactgttt ctgtacctng cttctaattc ct 112 

<210> 102 
<211> 118 
<212> DNA 

<213> Gorilla gorilla 
<400> 102 

tttcagaaaa caaaaactcc taagagaatg aatagaaatt atcttaagga gttataaaga 60 
accatctcag aattttactc tcttactgca aaatctcatg caaagataac caagtacg 118 

<210> 103 
<211> 146 
<212> DNA 

<213> Gorilla gorilla 
<400> 103 
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ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggccac tcatagttcc 
ttgccatgaa gacttttcca acatgactgc ttacttcttc aagcctgtaa ggagatccag 
tctcagtcgg ccaagtcttg taaaag 

<210> 104 
<211> 94 
<212> DNA 

<213> Gorilla gorilla 



* 60 
120 
14 6 



<400> 104 

gaaaataacg catttataaa acccagctca gatgttactt ccaaccaaca aatttcctgg 60 
acacactaag taatagagtt aatcacaccg tcct 94 



<210> 105 
<211> 229 
<212> DNA 

<213> Gorilla gorilla 
<40O> 105 

ttgcaggtcc acattcgttc tagaggctgc ctgcggtcct tagaggccac tcatagttcc 
ttgccatgaa gacttttcca acatgactgc ttacttcttc aagcctgtaa ggagatccag 
tctcagtcgg ccaagtcttg taaaaggtaa cctcatcacc ggagtcacgt ctcatcacct 
ttcccatgtt ccattggtta gaagtcccag ctcctgccca cattcaagg 



60 
120 
180 
229 



<210> 106 
<211> 205 
<212> DNA 

<213> Gorilla gorilla 



<400> 106 

tgaaggtctc cctgctgtgc ttagagaggg ggacgctgga ataagagtgg tcagagaagc 60 

aacattgctg gctctaaaga cagaggacag gccacgagcc aaggcatgca ggaggcctcc 120 

agaagctggg gcaggcaaga gaatggactc agccctggag cctccagaag gcacacagcc 180 

ctgcccgttc cttgctttta gccag 205 



<210> 107 
<211> 82 
<212> DNA 

<213> Gorilla gorilla 



<400> 107 

aaacaatacc aacagaaaca aagctatgta cttgttttgg gaagcaacat caaatggaaa 
gacgggggct gtgttctaaa ag 

<210> 108 
<211> 234 
<212> DNA 

<213> Gorilla gorilla 
<220> 

<221> misc_feature 
<222> 192 

<223> n=a, g, c or t 
<400> 108 

gaacattcac tctgggagct ctgtgctgcc atgcaaatgt cacattgcac tgaggccgcc 60 
atgctgtaaa aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 
cttttccagc attttgggtt ctcccagcta aatcattggc attgtgaagc agagatgaga 180 
cctcctgcca tnccctttcc aaatcctgga ccaacagaat catcatatac aata 234 

<210> 109 



60 
82 
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<211> 714 
<212> DMA 

<213> Gorilla gorilla 
<220> 

<221> misc_f eature 
<222> 192 

<223> n=a, g, c or t 



<400> 109 

gaacattcac 

atgctgtaaa 

cttttccagc 

cctcctgcca 

ttttacttta 

ctgccatgca 

acacacatta 

aaaaagttat 

aatggtaaaa 

tgaaagaagt 

tgttgaagac 

ttgtggatac 



tctgggagct 
aaaccccaag 
attttgggtt 
tnccctttcc 
agccactaat 
ggagctgata 
cttgcttcac 
tcatgggaaa 
tattaaatta 
ttgggtcttt 
aaagaatgtt 
tctgcatata 



ctgtgctgcG 
ctacctacct 
ctcccagcta 
aaatcctgga 
ttggggtgta 
aacttgctca 
tctgctttct 
ttaactcaag 
ttgtgagtag 
tgtggatacg 
gaaggaatat 
gacattctag 



atgcaaatgt 
gaaaagctgg 
aatcattggc 
ccaacagaat 
gctcttttgc 
ctgttatgta 
ggctttccat 
atgaatattg 
tttttattta 
ttgcatatag 
agacatgaaa 
actggagtga 



cacattgcac 
agagaatcag 
attgtgaagc 
catcatatac 
agcaaaagat 
tgtgtccaat 
tcagcaaatt 
ttgtagttaa 
aagaagtttg 
acattctaga 
gaatgttgaa 
tacatcttgt 



tgaggccgcc 
cacccatcag 
agagatgaga 
aatagtaagc 
aatcagaaca 
gactactcac 
tgaagaaatg 
tctctttgta 
gttctttaaa 
catgaaagaa 
gtttgttttt 
tcag 



<210> 110 
<211> 220 
<212> DNA 

<213> Gorilla gorilla 
<220> 

<221> misc_f eature 
<222> 122 

<223> n~a/ g, c or t 
<400> 110 

acatgaaaga atgttgaagt ttgttttttt gtggatactc tgcatataga cattctagac 
tggagtgata catcttgttc aggtttctat gtttttaaaa tgataacttg acacacctac 
crxatgccaat ttttagggta tatatctgta tatctatgtg aactaaggaa aattcaggaa 
tgattaaacc tacatatttt gccaggagag accggataat 

<210> 111 
<211> 209 
<212> DNA 

<213> Gorilla gorilla 
<220> 

<221> misc_feature 

<222> 97,115,117,130,134,160,193 

<223> n-a, g, c or t 

<400> 111 

ggttttgtgc atttctaaga tgatcatctt ggtgtggggt cctgatcttc ccagtgactt 
agcagcacag taatttggct gaaatgttga tgagttnaga caagaagaag gtgangntga 
caagagagcn tcanggggaa attctgccct gaggaaggan tagtttgagg ctggagagat 
gtgctacccc cgncactctt tctcaaatg 

<210> 112 
<211> 1105 
<212> DNA 

<213> Gorilla gorilla 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
714 



60 
120 
180 
220 



60 
120 
180 
209 
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<221> mis cofeature 
<222> 959 

<223> n^a, g, c or t 



<400> 112 

gaaagaaaac 

tttgagagct 

acctttgttg 

gtataaagac 

atagaaaatt 

tcattctttt 

ccataatgtt 

gagaatctat 

tttatgtggt 

ggccctagat 

aaattgtaat 

agagcatcga 

agggcaaagg 

aatactttga 

tgtgtottaa 

aaatttaact 

taaaatctaa 

actttaatat 

gaaaatatta 



ctgtaggcac 

gagcattgga 

atctaatact 

ctgtgtctct 

ttatccccct^ 

gaaaaattgt 

cttgaccact 

tgaggttgta 

attatgtgat 

ggggcagcct 

gacaaagctc. 

tcatttctca 

acatttcaaa 

aatcttacca 

ggtctgttgg 

gacacaaagt 

agtaagaata 

gttacatgtt 

ttcagtattt 



agaaatcatt 
ttttgatggg 
tctgtgatac 
cc:atgttttg 
ttcaagtgac 
ctgaatacaa 
ttagattgac 
tggaataacc 
gtttttttgg 
tctaatgctc 
acagcagctg 
cgccttcatg 
gtttggatgg 
aatattaaat 
atctttcaaa 
tacacagaac 
attttgaaat 
tgaagtttca 
ttttt 



gctctgctta 
agagtgaaac 
agttatctaa 
ctacttataa 
tttttataaa 
gagaagtaac 
aaatagtgtc 
aagtgatgga 
tgggggtagt 
ccagcatttc 
cctacgccag 
aatgcacatg 
aatctttcct 
aaatagcaac 
tttctatttt 
tcctggctga 
tttattggta 
tttatactta 



taetgaaatc 
ttagtgaaga 
acagaatgag 
ttgcagagga 
gcgtaacctt 
tgaacagcaa 
tcaaaattga 
agaggttgga 
gtctggagct 
gcaaatcata 
gtttggtgcc 
tgtgcacatt 
acttagaagt 
aatttttcca 
ggacatggaa 
tctgttttac 
atgcttaaat 
atatctaatt 



tcctgctgac 
aatgatttga 
tctaaagtca 
acattttgag 
ttaagagagg 
gggattgtag 
tcaggttccg 
agtgaataat 
gagcgtgatg 
tgtcacgtca 
agctctaatt 
ttcatatata 
aataaaaaac 
tttaaagatg 
actacagatg 
ttaattctnc 
aacaaaagtt 
taactcttgt 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
720 
780 
840 
9Q0 
960 
1020 
1080 
1105 



<210> 113 
<211> 112 
<212> DNA 

<213> Pongo pygmaeus 



<220> 

<221> misc_feature 
<222> 69. *71 r 99,106 
<223> n=a, g, c or t 

<400> 113 

tcccagtggg attcaaccac tcatagactg cttttgttcg aactgcgttc aaataaggca 60 
aacgccgann nggaaccaat ctcactgttt ctgtacctna cttatnattc ct 112 

<210> 114 
<211> 118 
<212> DNA 

<213> Pongo pygmaeus 



<400> 114 

tttcagaaaa caaaaactcc taagagaatg aatagaaatt atcttaagga attataaaga 60 
accgtctcag aattttcctc tgttactgca aaatctcatg caaagatgac caagtatg 118 

<210> 115 
<211> 146 
<212> DNA 

<213> Pongo pygmaeus 
<400> 115 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggtcac tcatagttca 60 
ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120 
tctcagttgg ccaaatcttg tataag 14 6 



<210> 116 
<211> 94 
<212> DNA 
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<213> Pongo pygmaeus 
<4 00> 116 

ggaaataacg catttataaa acccagctca gatattactt ccaaccaaca aattgcctgg 60 
acacactaag taatagagtt aatcacacca tctt 94 

<210> 117 
<211> 229 
<212> DMA 

<213> Pongo pygmaeus 
<400> 117 

ttgcaggtcc acattcgttc tagaggctgc ctgcagtcct tagaggtcac tcatagttca 60 
ttgccatgag gacttttcca acatggctgc ttacttcttc aagcctgtaa ggagatctgg 120' 
tctcagttgg ccaaatcttg tataaggtaa cctcatcacc ggagtcatgt ctcatcacct 180 
ttcccgtgtt ccattggtta gaagtcccag ctcctgccca cattcaagg 229 

<210> 118 
<211> 234 
<212> DNA 

<213> Pongo pygmaeus 
<400> 118 

gaacattcac tctgagagct ctgtgctgcc atgcaaatgt cagattgcac taaggccgtc 60 
atgctgtaag aaaccccaag ctacctacct gaaaagctgg agagaatcag cacccatcag 120 
cttttccagc cctttgggtc ctcccagcta aatcataggt attgtgaagc agagatgaga 180 
cctcctgcca tgccctttcc aaatcctgga ccaacagaat catcatatac aata 234 

<210> 119 
<211> 599 
<212> DMA 

<213> Pongo pygmaeus 



<400> 119 

gaacattcac 

atgctgtaag 

cttttccagc 

cctcctgcca 

ttttacttta 

ctgccatgca 

acacacatta 

aaaatgttat 

aatggtaaaa 

tgaaagaagt 



tctgagagct 
aaaccccaag 
cctttgggtc 
tgccctttcc 
agccactaat 
ggagctgata 
cttgcttcac 
tcatgggaaa 
tattaaatta 
ttgggtcttt 



ctgtgctgcc 
ctacctacct 
ctcccagcta 
aaatcctgga 
ttggggtgta 
aacttgttca 
tctactttct 
ttaactcaag 
ttgtgagtca 
tgtggatact 



atgcaaatgt 
gaaaagctgg 
aatcataggt 
ccaacagaat 
gttcttttgc 
ctgttatgta 
ggctttccat 
atgaatattg 
cttttattta 
ctgcatatag 



cagattgcac 
agagaatcag 
attgtgaagc 
catcatatac 
agcaaaagat 
tgtatcc^at 
tcagcaaatt 
ttgtagttaa 
aagaagtttg 
accttctaga 



taaggccgtc 
cacccatcag 
agagatgaga 
aatagtaagc 
aatcagaaca 
gactactcac 
ttaagaaatg 
tctctttgtc 
gttctttaaa 
ctggagtga 



60 
120 
180 
•240 
300 
360 
420 
480 
540 
599 



<210> 120 
<211> 198 
<212> DNA 

<213> Pongo pygmaeus 
<400> 120 

gaagtttggg tcttttgtgg atactctgca tatagacctt ctagactgga gtgatatatc 
ttgttcaggt ttctatgttt ttaaaatgat aacctgacac acctaccatt gccaattttt 
aggatatata tctgtgtgaa ctaaggaaag ttcaggaatg attaaaccta catattttgc 
caggagagac tggataat 



60 
120 
180 
198 



<210> 121 
<211> 152 
<212> DNA 

<213> Pongo pygmaeus 
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<220> 

<221> raisc_f eature 
<222> 73 

<223> n=a, g, c or t 
<400> 121 

tgacttagca gcacagtaat ttggctgaaa tgttgatgag ttcagacaag aaggtgsrtga 60 
tgacaagaca gcntcacggg gaaattctgc cctgaggaag gaatagtttg aggctggaga' 120 
gatgtgctac ccccgccact ctttgtcaaa tg 152 

<210> 122 
<211> 119 
<212> DNA 

<213> Macaca mulatta 



<220> 

<221> misc_feature 
<222> 70,81,83,85,88,93 
<223> n^a, g, c or t 



<400> 122 

tgtcagaaaa caaaaactcc taagagaatt aatagaaatt atcttaagga attataaaga 60 
acccatctcn cagaaattta ntntnttnac ggnaaaatct catgcaaaga taaccaagt 119 

<210> 123 
<211> 146 
<212> DNA 

<213> Macaca mulatta 



<400> 123 

tagtacgtcc acattcattc tagatgctgc ctgaagtcct tagatgcctc tcatagttcc 60 

ttgccatgag cacttttcca acatggctga tgactgcttc aagcctgtaa ggagatctgg 120 

tctcagttgg ccaagtcttg gataat 14 6 

<210> 124 
<211> 74 
<212> DNA 

<213> Macaca mulatta 



<220> 

<221> mi$c_feature 
<222> 6,37,48,56 
<223> n^a, g, c or t 



<400> 124 

ctcagnatat tacctccaac ctacaaattt cctgggnaca cactaagnta ataagnagtt 60 
aatcacaccg tcct 74 

<210> 125 
<211> 229 
<212> DNA 

<213> Macaca mulatta 



<400> 125 

tagtacgtcc acattcattc tagatgctgc ctgaagtcct tagatgcctc tcatagttcc 60 

ttgccatgag cacttttcca acatggctga tgactgcttc aagcctgtaa ggagatctgg 120 

tctcagttgg ccaagtcttg gataatgtaa cctcatcacc ggagtcacgt ctcatcacct 180 

ttcccatgtt ccattggtta gaagtcccag cttctgccca cattcaaga 229 

<210> 126 
<211> 202 
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<212> DMA 

<213> Macaca mulatta 
<220> 

<221> misc_feature 
<222> 127. .128,130*. .131 
<223> n=a f g, c or t 

<400> 126 

tgaaggtctc cctgctgtgc tgagagaggg gcatgctgga agaagagtgg tcagagaagc 60 

aacattgctg gctctaaaga cagaggacag gccatgaacc aaggcatgca ggcggcctcc 120 

gaaagcnngn ncaggcaaga gaatggactc tgccctggag cctccagaac gcgtgcagcc 180 

ctgcccattc cttgctttta gc 202 

<210> 127 
<211> 82 
<212> DNA 

<213> Macaca mulatta 
<220> 

<221> misc_feature 
<222> 6,8,30,52,72 
<223> n*=a, g, c or t 

<4 00> 127 

aaacantncc aacagaaaca aagctttgtn cttgttttgg gaagcaacct cnaatggtaa 60 
gatgggggct gngttctaaa ag 32 

<210> 128 
<211> 243 

<212> DNA ' 
<213> Macaca mulatta 



<210> 129 
<211> 727 
<212> DNA 

<213> Macaca mulatta 
<4O0> 129 

gaacattcac cgtggctgct ctgcgctgcc atacaatgtc agattgcact gaggccgcca 60 

tgctgtaaga aaccccaagc tgcctgcctg aaaagactgg agagacacag cacccatcag 120 

cttttccagc cttttgggtc ctcccagcta aattacaggc attgtgaagc agagatgaga 180 

agagatgaga cctcctgcgt gccctttcca aatcccggct cagcagaatc atcctacaca 240 

atagtaagtt tttacttgaa gccact'aatt tggggagtag ttcttatgca gcaaaagata 300 

atcagaacac tgcgccatgg caggagctga taaacctgct cactgttacg cgtgtatcca 360 

atgactactc acacacacat gacttgcttc actctacttt ctggctttcc actcagcaaa 420 

ttttaagaaa tgagaatgtt attcatgaga aattaactca agatgaatat tgttatcgtt 4 80 

aaactcttcg taaatggtaa aatactacat tattgctagt catttttatt taaagaagtt 540 

tggttcttta aatgaaagaa gtttgggtct tttgtggata ctctgcatat agatattcta 600 

gacatgaaag aacgttgaag acaaagaatg ttgaaggcat atagacatga aggaattttg 660 

aagttgggtt cttttgtgga tactctgcat atagacattc tagactggag tgatatgtct 720 

tgttcaa 727 

<210> 130 



<4O0> 128 



gaacattcac cgtggctgct ctgcgctgcc atacaatgtc agattgcact gaggccgcca 
tgctgtaaga aaccccaagc, tgcctgcctg aaaagactgg agagacacag cacccatcag 
cttttccagc cttttgggtc ctcccagcta aattacaggc attgtgaagc agagatgaga 
agagatgaga cctcctgcgt gccctttcca aatcccggct cagcagaatc atcctacaca 
ata 



60 
120 
180 
240 
243 
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<211> 214 
<212> DMA 

<213> Macaca mulatta 



<400> 130 

acatgaagga attttgaagt tgggttcttt tgtggatact ctgcatatag acattctaga 60 

ctggagtgat atgtcttgtt caagcttcta tgtttttaaa atgataactt gacacaccta 120 

cccaatgcca atttttagga cataaatctg tgtgaactaa ggaaaattca ggaatgatta 180 

aacctacata ttttcccaag agagactgga taac 214 

<210> 131 
<211> 77 
<212> DNA 

<213> Macaca mulatta 



<400> 131 

agagcgfcgac ggggaaattc tgccctgagg aaggaatagt ttgaggctgg agagatgtgc 60 
tacccccgcc actcttt 77' 

<210> 132 
<211> 306 
<212> DNA 

<213> Macaca .mulatta 



<400> 132 

taaaatgact 

taatgttacc 

ttttgtagtt 

aaatccatta 

tgataaatgt 

ccactc 



tgttcagggc 
acgctaatcc 
gaggagcatg 
ttatcataat 
gattttattt 



tccttttggg 
aataacactt 
gaaaatacca 
ttaaaattag. 
gtttgcttgt 



attttaagtg 
catagatggt 
cctatatttg 
atttaaatac 
ttttttgcag 



agaaaaactc 
tagccatctc 
gatttctgtt 
aaacctcttt 
ggagggatat 



tatttcataa 
taaaattaaa 
taacacttca 
atagaaaagc 
tcccttgata 



60 
120 
180 
240 
300 
306 



<210> 133 
<211> 18 
<212> DNA 

<213> Artificial Sequence 
<220> 

<223> sequencing oligonucleotide PrimerPU 
<400> 133 

tgtaaaacga cggccagt 18 

<210> 134 
<211> 18 
<212> DMA 

<213> Artificial Sequence 
<220> 

<223> sequencing oligonucleotide PrimerRP 

<400> 134 ' 
caggaaacag ctatgacc .18 
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